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(54) CANCEROUS METASTASIS INHIBITOR 

(57) A chimeric protein wherein HI-8 which is the C- 
terminal domain of human urinary trypsin inhibitor (UTI) 
having a cancer cell metastasis inhibitory effect, is 
linked to a peptide containing the G domain of uroki- 
nase binding specifically to urokinase receptor 
expressed in a large amount in cancer cells. 
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Description 

Field of the Invention 

The invention relates to a chimeric protein wherein HI-8 which is the C-terminal domain of human urinary trypsin 
inhibitor (UTI) having a cancer cell metastasis inhibitory effect is linked to a peptide containing the G domain of uroki- 
nase binding specifically to ah urokinase receptor expressed in large amounts on cancer cells. 

Background Art 



In current cancer therapy, although advances in early diagnosis and therapy increase a therapeutic rate, an effec- 
tive remedy against cancer metastasis has not been found. Inhibition of metastasis of cancer is a serious problem. 
Recent active research clarifies a molecular biological mechanism on metastasis of cancer cells. It has been found that 
invasion of cancer cells into normal tissue requires actions of a variety of proteases (1) (2). Urokinase-type plasmino- 
is gen activator(uPA), which is one of serine proteases, is noted earlier as a protease increased with canceration of cells 
(3). It is reported that the amount of uPA extracted from cancer tissue is generally correlated with malignancy of cancer 
cells (4). In addition, it is believed that secretion of precursor-type enzymes such as uPA and metal loproteases including 
collagenase and stromelysin, and a proteolysis cascade including an activation process of the precursor-type enzymes 
are closely related to an invasion process of cancer cells (5). uPA, which is a glycoprotein having a molecular weight of 
20 55kDa, has a three-domain structure of, from N-terminai, growth factor-like domain (G domain), kringle domain (K 
domain) and protease domain (P domain) (see, Fig. 1 ). G domain is a site to be bound to an urokinase receptor (uPAR) 
which is a specific receptor on cells (6). It is believed that uPA binds to membrane of cancer cells through the domain 
and plays an important role during invasion (7) (8) (9) (10) (11). Cancer cells also increase* uPA concentration in the 
direction to be migrated by collecting uPAR capable of binding to uPA on the tip of migration direction (12). The uPAs 
25 bound to cell membrane activate a variety of proteases such as plasminogen on the surface of membrane and degrade 
extracellular matrices (13) (14) (15). 

It is known that plasmin activated by uPA on the surface of membrane of endothelial cell activates latent TGF-p 
(transforming growth factor p) which exists on the surface of mural cell (16). It is known that TGF-p induces production 
of plasminogen activator inhibitor 1 (PAI-1) which is a selective inhibitory factor of uPA and stimulates expression of 
30 mRNA of uPA (17). TGF-p controls vascularization according to concentration thereof differently. 

In view of foregoing, experiments to inhibit metastasis of cancer cell by inhibiting actions of uPA on the membrane 
of cancer cell have been tried. Reported are inhibition on invasion by antibody (1 8) or inhibitor (1 9) against uPA, or, inhi- 
bition on invasion by antibody (21) and peptides (22) (23) which inhibit bonding of uPA to uPAR. 

An amino terminal fragment (ATF) of uPA (residues 1-135 of uPA) is a polypeptide comprising G domain to be 
35 bound to uPAR and adjacent K domain, and competitively inhibits binding of uPA to uPAR. It is reported by Crowley et 
al. that a chimeric protein comprising a polypeptide containing 137 amino acids from N-terminal including ATF bound 
to a Fc region of immunoglobulin G is produced and that the protein inhibits metastasis of human cancer cells in vivo 
(24). Lu et al. prepare a chimeric protein wherein ATF is bound to human serum albumin (HSA) through a spacer con- 
sisting of 4 glycines in yeast. They reported that the chimeric protein bound to uPAR in vitro and inhibited binding of uPA 
40 to cancer cell membrane (25). These chimeric proteins were produced to stabilize characteristics of ATF having uPA 
binding inhibitory action in vivo and to increase metastasis inhibitory effects. 

Ballance et al. reports a method for producing chimeric proteins in yeast wherein G domain of uPA is bound to plas- 
minogen activator inhibitor-2 (PAI-2) which is an inhibitor of uPA, or, to a! -antitrypsin (<x r AT) which is a plasmin inhibitor 
(26). The chimeric protein was produced to increase inhibitory properties by combining G domain properties on binding 
45 to uPAR with inhibitory properties of enzymes relating to metastasis. However, experimental data relating to the metas- 
tasis inhibitory effect of this chimeric protein have not been reported. 

Recently, the inventors found that human urinary tripsin inhibitor (UTI) inhibits invasion of cancer cells (27). UTI 
demonstrated not only invasion inhibitory effect of cancer cell in vitro (28), but also metastasis inhibitory effect in model 
system in vivo (29). In addition, the inventors found that os-antiplasmin (c^-AP) and a 2 -macroglobulin (agM), which are 
so plasmin inhibitors belonging to a serpin family, do not inhibit a plasmin activity on plasma membrane, and that UTI inhib- 
ited a plasmin activity on plasma membrane leading to inhibition of invasion of cancer cell (29). 

UTI comprises two Kunitz-type inhibitor domains and sugar chains (Fig.2). A plasmin inhibitor site is located in Hl- 
8, which is a second domain (residue 78-143 of UTI) on C-terminal side of UTI (30). The inventors demonstrates that 
HI-8 has a metastasis inhibitory activity (31). Recent research confirmed that HI-8 inhibited invasion and metastasis 
55 under mechanisms other than protease inhibitory action. HI-8 inhibits invasion of cancer cell, on the surface of which is 
not proved to have a plasmin activity. HI-8 is believed to inhibit invasion and metastasis of cancer cells by protease 
inhibitory action, and also inhibition of influx of calcium ion and regulation of protein kinase C (PKC) activity. 

The inventors produced crosslinked compounds wherein ATF was chemically bound to UTI or HI-8 so as to improve 



10 



2 



EP 0 890 638 A2 

an inhibitory effect by collecting UTI or HI-8 on cancer cells. The crosslinked compounds are found to inhibit metastasis 
of cancer cells in vitro effectively (32). The compounds synthesized by crosslinking agent, however, have a drawback in 
an industrial applicability that the compounds have crosslinks in a~variety of manners leading to difficulty in large-scale 
production of substances with single structure. 
5 An inhibitor of cancerous metastasis is a drug administered simultaneously in chemotherapy in case that primary 

tumor is removed by operation or that surgical treatment is difficult. In the cases, patients to be cured having decreased 
physical fitness can not tolerate drugs with potent toxicity. Recently, chemotherapeutic agents are revaluated in large 
scale from the viewpoint of decrease of self-healing ability due to side-effects of anti-cancer agents and of quality of life 
of patients during therapy. 

io UTI sample purified from human urine is used in medicinal application as curative medicine for acute circulatory 
failure and pancreatitis. UTI is a protein whose safety has already been confirmed in intravascular administration (33) 
(34) (35) (36). Since HI-8 is a part of UTI whose safety is confirmed, it is expected that HI-8 should be developed as 
cancerous metastasis inhibitor with low toxicity to human. In addition, Q domain of uPA which is a region for binding to 
a receptor (uPAR) expressed in large amounts on metastatic cancer cells has actions of metastasis inhibition by inhib- 

75 iting binding of uPA to cancer cells and of specific binding molecule to cancer cells. uPA is a substance which has 
already been developed as drug and has examined safety thereof. In view of foregoing, it is expected that a chimeric 
protein prepared by linking a polypeptide comprising G domain of uPA with HI-8 should have effective metastasis inhib- 
itory actions based on combined properties of two proteins. In addition, the chimeric protein which utilizes partial 
sequences concerning specific functions of two drugs whose safety are established will be used as cancerous metas- 

20 tasis inhibitor with lower toxicity. Furthermore, a large scale production of the chimeric protein as substance having sin- 
gle structure of one polypeptide chain according to gene engineering techniques will greatly contribute to research of 
cancerous metastasis inhibition and development of inhibitor. 

Brief Description of the Drawings 

25 

Fig. 1 shows a primary structure of urokinase (uPA) (from fig.7 (1), 1712page of TAKAHASHI Takashi, KO Enki 
(1991). TAN PAKUS ITSU KAKU SAN KOSO, 36. 1705-1715). 

Fig. 2 shows a primary structure of UTI (partially modified fig. 1(B) in page 459 of YONEDA Masahiko, KIMATA Koji; 
SEIKAGAKU, 67:458-465, 1995). 
30 Fig. 3 shows a primary structure of chimeric protein ATFHI. 

Fig. 4 shows a structure of synthetic DNA adaptor BamHI-Taql DNA. 

Fig. 5 shows a relationship of positions of cDNA structure of uPA coding for ATF portion and primer used for clon- 
ing. 

Fig. 6 shows a HI-8 gene of pCD17R15 and a primer used for obtaining partial DNA of HI-8 according to PCR. 
35 Fig. 7 shows a procedure to construct plasmid pTAK. 
Fig. 8 shows a procedure to construct plasmid pHIK. 
Fig. 9 shows a procedure to construct plasmid pAIP. 

Fig. 10 shows a sequence of synthetic HI-8 DNA to construct plasmid pCD17R15. 

Fig. 1 1 shows a procedure to construct plasmid pCD17R15. 
40 Fig. 12 shows a primary structure of chimeric protein ATFHI-CL. 

Fig. 13 shows a procedure to construct plasmid pAIP-CL 

Fig. 14 shows a primary structure of chimeric protein ATFHI-ML. 

Fig. 15 shows a procedure to construct plasmid pAIP-ML 

Fig. 16 shows a plasmin inhibition (IC50) effect of chimeric proteins. 
45 Fig. 1 7 shows a binding effect of chimeric proteins to U937 cell. 

Disclosure of the Invention 

The inventors worked out a molecular design to maintain a native steric structure of each domain of chimeric pro- 
se tein. The inventors also worked out a design of plasmid to express the chimeric protein in Escherichia coli effectively. A 
chimeric protein expressed in E. coii may be accumulated in large amounts in bacterial cell as insoluble inclusion body. 
A chimeric protein may be collected by refolding treatment followed by purification process as a single substance recov- 
ering a steric structure. The chimeric protein maintains both propertied of G domain function binding to uPAR derived 
from uPA and of plasmin inhibitory function derived from HI-8. Furthermore, it is confirmed from results of cancer cell 
55 invasion inhibitory experiment in vitro and of a metastasis inhibition experiment in vivo that the chimeric protein has 
inhibitory activities of invasion and metastasis higher than ATF and HI-8. 

The invention provides a chimeric protein having a cancerous metastasis inhibitory activity, a DNA coding for a chi- 
meric protein, a plasmid comprising the DNA, a transformant maintaining the plasmid. a method for producing the chi- 
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meric protein and a method for prophylaxis of cancerous metastasis. 

Item 1. A chimeric protein comprising a sequence of the following~(formula 1) on N-terminal side and a sequence 
of the following (formula 2) on C-terminal side: 

(formula 1) 

Cys Asp Cys Leu Asn Gly Gly Thr Cys Val Ser Asn Lys Tyr Phe Ser 
Asn He His Trp Cys Asa Cys Pro Lys Lys Phe Gly Gly Gin His Cys 

(Formula 2) 

Cys Asn Leu Pro lie Val Ar* Gly Pro Cys Ar* Ala Phe lie Gin Leu 
Trp Ala Phe Asp Ala Val Lys Gly Lys Cys V a l Leu Phe Pro Tyr Gly 
Gly Cys Gin Gly Asn Gly Asn Lys Phe Tyr Ser Glu Lys Glu Cys Ar* 
Glu Tyr Cys 



Item 2. The chimeric protein according to item 1 which further comprises an intervening sequence containing any 
one of the following 4 sequences between said (formula 1) and said (formula 2): 

• (formula 3) -Ala Asp Gly Thr Val Ala Ala 



• (formula 3) -Ala Asp Gly Lys Lys Pro Ser Ser Pro Pro Glu 

Glu Leu Lys Phe Gin Gly Thr Val Ala Ala; 

• Glu He Asp Lys Ser Lys Thr Thr Val Ala Ala; and 

• Glu He Asp Lys Ser Lys Thr Val Ala Ala, 

Item 3. The chimeric protein according to item 1 comprising a sequence represented by (formula A): 
N terminal-(sequence l)-(formula 1)-(sequence ll)-(formula 2)-(sequence lll)-C terminal (formula A) in (for- 
mula A), (formula 1) and (formula 2) are as defined above. 

(Sequence I) represents a hydrogen atom or any one of the following amino acid sequences: 
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Ser Asn Glu Leu His Gin Val Pro Ser Asn 
Asn Glu Leu His Gin Val Pro Ser Asn 
Glu Leu His Gin Val Pro Ser Asn 
Leu His Gin Val Pro Ser Asn 
His Gin Val Pro Ser Asn 
Gin Val Pro Ser Asn 
Val Pro Ser Asn 
Pro Ser Asn 
Ser Asn 
Asn 



(sequence II) represents any one of sequences selected from a group containing (formula 3) and a group not 
containing (formula 3) 

a group containing (formula 3) 
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(formula 
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-Ala 


Asp 


Gly 


Thr 


Val 


Al-fc 
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( formula 
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Asp 


Gly 


Val 


Ala 


Ala 
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Gly 


Ala 


Ala 
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( formula 
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-Ala 


Asp 


Gly 


Xaa 








(formula 


3] 


-Ala 


Asp 


Thr 


Val 


Ala 


Ala 
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( formula 


3) 


-Ala 


Asp 


Val 


Ala 


Ala 






(formula 


3) 


-Ala 


Asp 


Ala 


Ala 






SO 


(formula 


3) 


-Ala 


Asp 


Xaa 








( formula 


3) 


-Ala 


Thr 


Val 


Ala 


Ala 






(formula 


3) 


-Ala 


Val 


Ala 


Ala 
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(formula 


3) 


-Xaa 


Thr 


Val 


Ala 


Ala 






(formula 


3) 


-Xaa 


Val 


Ala 


Ala 






30 


( formula 
( formula 


3) 
3) 


-Xaa 
-Xaa 


Ala 
Xaa 


Ala 








35 


(formula 
(formula 


3) 
3) 


-Val 
-Xaa 


Ala 


Ala 
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(formula 


3) 


-Ala 


Asp 


Gly 


Lys 


Lys 


Pro 



45 



SO 



Glu Leu Lys Phe Gin Gly Thr Val Ala Ala 



a group not containing (formula 3) 
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Glu lie Asp Lys 
5 Glu lie Asp Lys 

Glu lie Asp Lys 
10 Glu lie Asp Lys 

Glu lie Asp Lys 

Glu lie Asp Lys 

15 

Glu lie Asp Lys 
Glu lie Asp Lys 

20 

Glu lie Asp Lys 
Glu lie Asp Lys 
Glu lie Asp Lys 
Glu lie Asp Lys 

30 Glu lie Asp Lys 

Glu lie Asp Lys 

35 Glu lie Asp Lys 

Glu lie Asp Thr 

40 Glu He Asp Val 

Glu He Asp Ala 
Glu He Asp Xaa 

45 

Glu He Thr Val 
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Ser Lys Tftr Thr Val Ala Ala 

Ser Lys Thr Val Ala Ala 

Ser Lys Thr Ala Ala 

Ser Lys Thr Xaa 

Ser Lys Xaa 

Ser Lys Val Ala Ala 

Ser Lys Ala Ala 

Ser Thr Val Ala Ala 

Ser Val Ala Ala 

Ser Ala Ala 

Ser Xaa 

Thr Val Ala Ala 
Val Ala Ala 
Ala Ala 
Xaa 

Val Ala Ala 
Ala Ala 
Ala 

Ala Ala 
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Glu lie Val Ala Ata 

Glu He Ala Ala. 

Glu He Xaa 

Glu Thr Val Ala Ala 

Glu Val Ala Ala 

Glu Ala Ala 

Glu Xaa 

Xaa 

provided that Xaa represents any amino acid constituting a protein, formula 3 represents the following 
sequence corresponding to 43-131 of uPA: 

{formula 3) 

Glu He Asp Lys Ser Lys Thr Cys Tyr Glu Gly Asn Gly His Phe Tyr 

Arg Gly Lys Ala Ser Thr Asp Thr Met Gly Arg Pro Cys Leu Pro Trp 

Asn Ser Ala Thr Val Leu Gin Gin Thr Tyr His Ala His Arg Ser Asp 

Ala Leu Gin Leu Gly Leu Gly Lys His Asn Tyr Cys Arg Asn Pro Asp 

Asn Arg Arg Arg Pro Trp Cys Tyr Val Gin Val Gly Leu Lys Pro Leu 

Val Gin Glu Cys Met Val His Asp Cys 



(Sequence III) represents a hydroxyl group (-OH) or any of the following amino acid sequences: 
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Gly 


Val 


Pro 


Gly Asp Gly Asp 


Glu 


Glu 


Gly 


Val 


Pro 


Gly Asp Gly Asp 


Glu 


Glu 


Gly 


Val 


Pro 


Gly Asp Gly Asp 


Glu 


Glu 


Gly 


Val 


Pro 


Gly Asp Gly Asp 


Glu 




Gly 


Val 


Pro 


Gly Asp Gly Asp 






Gly 


Val 


Pro 


Gly Asp Gly 






Gly 


Val 


Pro 


Gly Asp 






Gly 


Val 


Pro 


Gly 






Glv 


Val 


Pro 








Gly 


Val 










Gly 













Item 4. The chimeric protein according to item 3 wherein sequence It is 

(formula 3) -Ala Asp Gly Thr Val Ala Ala 
or 

(formula 3) -Ala Asp Gly Lys Lys Pro Ser Ser Pro Pro Glu 
Glu Leu Lys Phe Gin Gly Thr Val Ala Ala 

when selected from a group containing (formula 3), and sequence II is 

Glu lie Asp Lys Ser Lys Thr Thr Val Ala Ala 
or 
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when selected from a group not containing (formula 3). 

Item 5. The chimeric protein according to item 3 wherein sequencej is represented by Ser Asn Glu Leu His Gin Val 
Pro Ser Asn. * 

Item 6. The chimeric protein according to item 3 wherein sequence I is represented by Ser Asn Glu Leu His Gin Val 
Pro Ser Asn, and sequence II is 

(formula 3) -Ala Asp Gly Thr Val Ala Ala 
or 

(formula 3) -Ala Asp Gly Lys Lys Pro Ser Ser Pro Pro Glu 
Glu Leu Lys Phe Gin Gly Thr Val Ala Ala 

when selected from a group containing (formula 3), and sequence II is 

Glu lie Asp Lys Ser Lys Thr Thr Val Ala Ala 
or 

Glu lie Asp Lys Ser Lys Thr Val Ala Ala 
when selected from a group not containing (formula 3). 

Item 7. A DNA coding for a chimeric protein comprising a sequence of the following (formula 1) on 5' side and a 
sequence of the following (formula 2) on 3' side: 

(formula 1) 



Cys Asp Cys Leu Asn Gly Gly Thr Cys Val Ser Asn Lys Tyr Phe Ser 
Asn He His Trp Cys Asn Cys Pro Lys Lys Phe Gly Gly Gin His Cys 

(formula 2) 

Cys Asn Leu Pro lie Val krg Gly Pro Cys krg Ala Phe He Gin Leu 
Trp Ala Phe Asp Ala Val Lys Gly Lys Cys Val Leu Phe Pro Tyr Gly 
Gly Cys Gin Gly Asn Gly Asn Lys Phe Tyr Ser Glu Lys Glu Cys Arg 
Glu Tyr Cys 
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Item 8. The DNA according to item 7 coding for a chimeric protein comprising a sequence represented by (formula 

N terminal-(sequence l)-(formula 1)-(sequence ll)-(formula 2)-(sequence lll)-C terminal (formula A) in (for- 
mula A), (sequence I), (formula 1), (sequence II), (formula 2) and (sequence III) are as defined above, 
s Item 9. A plasmid comprising DNA according to item 7 or 8. 

Item 10. A tranformant into which the plasmid according to item 9 is introduced. 

Item 1 1. A cancerous metastasis inhibitor comprising the chimeric protein according to any of items 1-6 as active 
ingredient. 

Item 12. A method for producing a chimeric protein comprising introducing into a host cell a plasmid into which the 
io DNA according to item 7 or 8 is integrated to produce a transformant, culturing the transformant and recovering the 
chimeric protein from a culture. 

Item 13. A method for prophylaxis of cancerous metastasis comprising administering a therapeutic amount of the 
chimeric protein according to any of items 1-6 to a patient of cancer. 

Item 14. The transformant according to item 10 wherein said transformant is FERM BP-5293. 
75 Item 15. The transformant according to item 10 wherein said transformant is FERM BP-5745. 
Item 16. The transformant according to item 10 wherein said transformant is FERM BP-5746. 
Item 17. The protein according to Kern 1 comprising an amino acid sequence which corresponds to 1-193 of SEQ 
ID NO 1. 

Item 18. The protein according to item 1 comprising an amino acid sequence which corresponds to 1-200 of SEQ 
20 ID NO 2. 

Item 19. The protein according to item 1 comprising an amino acid sequence which corresponds to 1-207 of SEQ 
ID NO 3. 

Item 20. The DNA according to item 7 comprising a nucleic acid sequence which corresponds to 15-593 of SEQ ID 
NO 1. 

25 Item 21. The DNA according to item 7 comprising a nucleic acid sequence which corresponds to 15-614 of SEQ ID 
NO 2. 

Item 22. The DNA according to item 7 comprising a nucleic acid sequence which corresponds to 15-635 of SEQ ID 
NO 3. 

30 Any amino acid represented by Xaa which constitutes a protein indicates any of 20 amino acids constituting a nat- 
ural protein. 

The invention is described below in detail. 

The chimeric protein which is a subject of the invention is characterised in that the protein is a molecule having a 
property (A) of binding to uPAR and a property (B) of plasmin inhibitory activity. In order to express the property (A), 
35 maintenance of receptor-binding property of G domain from uPA is necessary. The sequence of G domain (from Cys 11 
to Cys 42 of uPA) may be modified by replacement, addition or deletion of amino acid as long as the property is main- 
tained. 

Therefore, a sequence to express property (A) of the invention comprises the sequence of (formula 1) from Cys 11 
to Cys 42 of uPA corresponding to G domain of uPA, and a derivative thereof maintaining receptor binding ability of uPA. 

AO 

Formula 1 : 



45 Cys Asp Cys Leu Asa Gly Gly Thr Cys Val Ser Asn Lys Tyr Phe Ser 

Asa He His Trp Cys Asn Cys Pro Lys Lys Phe Gly Gly Gin His Cys 



so 

The property (B) is derived from Kunitz-type domain of HI-8. The domain is defined by a sequence (formula 2) from 
Cys 5 to Cys 55 of HI-8. The Kunitz-type domain exerts a metastasis inhibitory action of cancer cells based on an inhibi- 
tory action against trypsin-like enzymes such as plasmin.- or a inhibitory action to protein kinase C. or an inhibitory 
55 action to influx of calcium ion. The domain may be modified by replacement, addition or deletion of amino acid as long 
as the metastasis inhibitory action of cancer cell is maintained by retaining at least one of these actions. Therefore, the 
sequence of (formula 2) comprises derivatives maintaining a metastasis inhibitory action of cancer cell 
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formula 2 : 
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Leu 


Pro 


He 


Val 


Arg 


Gly 


Pro 


Cys 


Arg 


Ala 


Phe 


He 


Gin 


Leu 
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Ala 


Phe 


Asp 


Ala 


Val 


Lys 


Gly 


Lys 


Cys 


Val 


Leu 


Phe 


Pro 


Tyr 


Gly 
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Gly 
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Gly 
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Glu 
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Glu 


Cys 


Arg 




Glu 


Tyr 


Cys 
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The sequences of (formula 1) and (formula 2) may be directly connected together by a peptide bond. However, with 
respect to the chimeric protein of the invention, an intervening sequence which have little or no effect on steric struc- 
tures (biological activities) of sequences of (formula 1) and (formula 2) is preferably inserted between sequences of said 
20 (formula 1) and said (formula 2). The intervening sequence is preferably exemplified by a sequence containing any of 
the following 4 amino acid sequences: 

• (formula 3) -Ala. Asp Gly Thr Val Ala Ala 

25 

• (formula 3) -Ala Asp Gly Lys Lys Pro Ser Ser Pro Pro Glu 

Glu Leu Lys Phe Gin Gly Thr Val Ala Ala; 

30 



35 * Glu lie Asp Lys Ser Lys Thr Thr Val Ala Ala; and 

• Glu lie Asp Lys Ser Lys Thr Val Ala Ala. 

40 

Any sequence may be further included between said four intervening sequences and said (formula 1) or (formula 

2). 

The desired chimeric protein of the invention is a polypeptide which has (formula 1 ) on N-terminal side and (formula 
45 2) on C-terminal side. The polypeptide may be represented by the following (formula A) 
(formula A): 

N terminal-(sequence l)-(formula 1)-(sequence ll)-(formula 2)-(sequence II l)-C terminal. 

Sequence I, sequence II and sequence III which are placed at both sides of formula 1 and formula 2 may be any 
sequence as long as domain structures (steric structures) of formula 1 and formula 2 do not interact with each other 

so and each of formula 1 and formula 2 maintains a functional property thereof. Formula 1 and formula 2 are preferably 
amino acid sequences existing in proteins from human, which will not give a harmful antigenicity to human body 
Sequence I is preferably naturally-occuring N-terminal sequence of human uPA (uPA; Ser 1 to Asn 10 ). However, amino 
acid sequences having 9-1 amino acid or amino acids, which are prepared by sequencially deleting one by one amino 
acid from N terminal thereof, are included in sequence I. In addition, sequence I may represent a hydrogen atom. In this 

55 case, N-terminal starts from formula 1 . Similarly, sequence III is preferably a native C-terminal sequence of HI-8 (HI-8; 
Gly 56 to Leu 66 ), but includes amino acid sequences having 10-1 amino acid or amino acids prepared by sequencially 
deleting one by one amino acid from C-terminal thereof. In addition, sequence III may represent a hydroxyl group (-OH). 
In this case, C-terminal ends in formula 2. 
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Sequence II combining formula 1 and formula 2 plays a role as spacer to link two functional domains. Sequence II 
comprise a sequence having at least one amino acid. In order to exclude an influence between two domain structures, 
the spacer region is preferably an amino acid sequence with long'and flexible structure, tn contrast, an extra sequence 
is not preferable from the viewpoint of antigenicity. Sequence II may be a combined sequence of a native sequence fol- 
lowing C-terminal of G domain of formula 1 (sequence Glu lie Asp Lys Ser Lys Thr) and a native N-terminal 
sequence of HI-8 of formula 2 (sequence 11-2: Thr Val Ala Ala). Sequence II may also be a sequence prepared by com- 
bining an amino acid sequence having 7-0 amino acid or acids which are obtained by sequencially deleting one by one 
amino acid from C-terminal of sequence il-1 with an amino acid sequence having 4-0 amino acid or acids which are 
obtained by sequencially deleting one by one amino acid from N-terminal of sequence 11-2, provided that the number of 
amino acid of sequence 11-1 and the number of amino acid of sequence 11-2 are not equal to 0 simultaneously. 

Sequence II is preferably a sequence capable of orientating Kunitz-type domain of HI-8 of formula 2 to the outside 
of plasma membrane, when G domain of formula 1 binds to a receptor. Sequence II may be a sequence including K 
domain of uPA (formula 3: uPA; Glu 43 to Cys 131 ). Since K domain is reported to bind to negatively-charged molecules 
such as heparin, said sequence may stabilize a bond of G domain to uPAR and also orientate C-terminal of K domain 
to the outside of plasma membrane. Example of sequence llcontaining K domain (formula 3) is a sequence in which 
formula 3 and a native sequence following C-terminal of formula 3 (sequence II-3: Ala Asp Gly) are linked to N-terminal 
sequence of HI-8 (sequence II-2: Thr Val Ala Ala). Sequence II also includes a combined sequence of an amino acid 
sequence having 3-0 amino acid or acids prepared by sequentially deleting one by one amino acid from C-terminal of 
sequence II-3 and an amino acid sequence having 4-0 amino acid or acids prepared by sequentially deleting one by 
one amino acid from N-terminal of sequence II-2 in combination, provided that the number of amino acid of sequence 
II-3 and the number of amino acid of sequence li-2 are not equal to 0 simultaneously. It is known that uPA is cleaved 
with plasmin between 135 position and 136 position C-terminal region including the cleavage position of uPA originally 
has a function as spacer to connect with protease domain (P domain) of uPA and includes a unique sequence (Lys Lys 
Pro Ser Ser Pro Pro Glu Glu). Since the sequence may be useful to express a function as spacer by forming a specific 
steric structure, sequence II may include sequences wherein a 43-147 sequence of uPA is linked to N-terminal 
sequence of HI-8 (sequence II-2: Thr Val Ala Ala) by interposing Gly therebetween. 

Preferable (sequence I) is represented by Ser Asn Glu Leu His Gin Val Pro Ser Asn. 

Preferable (sequence II) is 

(formula 3) -Ala Asp Gly Thr Val Ala Ala 
or 

(formula 3) -Ala Asp Gly Lys Lys Pro Ser Ser Pro Pro Glu 
Glu Leu Lys Phe Gin Gly Thr Val Ala Ala 



when selected from a group containing (formula 3), and is 

Glu lie Asp Lys Ser Lys Thr Thr Val Ala Ala 
or 

Glu He Asp Lys Ser Lys Thr Val Ala Ala 



when selected from a group not containing (formula 3). 
Preferable (sequence III) is represented by 
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• Gly Val Pro Gly Asp Gly Asp Glu Glu Leu Leu 

or 

♦ Gly Val Pro Gly. 



From the foregoing viewpoint, a novel polypeptide consisting of 193 amino acids in total having a 134 amino acid 
sequence (ATF) derived from uPA on the side of N-terminal and a 59 amino acid sequence derived from HI-8 on the 
side of C-terminal may be provided as example of chimeric protein which may be designed (Fig.3). A predictable molec- 
ular weight of the chimeric protein is 21 ,564. The chimeric protein is hereinafter referred to as ATFHI (as shown in SEQ 
IDNOI). 

Other preferable chimeric proteins included in the present invention are shown in SEQ ID Nos 2 and 3. 

Analytical calculation of biochemical properties of protein, or analysis on nucleic acid sequence may be done by 
using analysis software such as GENETYX (SOFTWARE DEVEROPMENT). 

A method for producing a protein of the invention is described below taking ATFHI as example. As host cells to pre- 
pare the chimeric protein ATFHI in large amounts, yeast, mammalian cells and like eucaryote cells and also E. coli and 
like procaryote cells may be used. In general, when a desired protein is expressed in E. coli, there are a method for 
secretion of the protein to periplasm fraction and a method for direct expression of the protein in cytosol as inclusion 
body. When secreted into periplasm, a desired substance may be obtained as a soluble protein having a steric struc- 
ture. However, there are disadvantages that the amount of secreted protein in periplasm is small and that the protein is 
likely to be cleaved by proteases, in contrast, when directly expressed in cytosol, a steric structure should be recon- 
structed by solubilizing an inclusion body of accumulated insoluble protein with protein solubilizer, followed by refolding 
the protein. Since most of the fraction of inclusion body is a desired protein, the direct expression method to cytosol is 
often used because of easiness of purification and large-scale production. In this case, since a DNA sequence coding 
for an objective product is directly linked to an initiation methionine codon, it is necessary to remove N-terminal methio- 
nine from an expressed objective substance. It is known that N-terminal methionine is removed by methionine ami- 
nopeptidase (MAP) with respect to most of newly generated proteins in cytosol. The cleavage by the peptidase is 
greatly affected by types of amino adds next to initiation methionine (37). N-terminal amino acid of uPA is serine. When 
amino acids having a short side chain such as serine follows an initiation methionine, methionine is likely to be cleaved 
by MAR It is reported that N-terminal methionine is removed in case of direct expression of uPA using E. coli (38). Thus, 
ATFHI may be prepared by a direct expression method using E. coli Methionine may be removed when an amino acid 
other than serine is selected as amino acid next to methionine. 

A method for constructing a plasmid expessing a chimeric protein in E. coli is described below. DNA as material 
may easily be synthesized using a chemical synthesis method because of improvement of performance and spread of 
a current DNA synthesizer, when DNA sequence is known. Preparation of cDNA by screening a cDNA library may be 
easily carried out using a commercially available kit. Necessary parts of DNAs of uPA and UTI may be cloned using a 
DNA cloning kit for PCR and a variety types of gene libraries which are commercially available. 

cDNA of uPA has been already cloned. The gene structure thereof is clarified by Heyneker et al (39). Method for 
producing uPA and analogs thereof using microorganisms and animal cells are also reported (40) (41) (42) (43). Nec- 
essary parts of DNA may be chemically synthesized, and cDNA of uPA may be easily obtained by separation from a 
suitable gene Itorary by referring to the cDNA sequence described in the reports. A partial cDNA of uPA coding for ATF 
portion may be cloned according to a PCR method as shown below. First, suitable primer regions for PCR amplification 
are selected from DNA sequence containing a sequence from gene sequence of uPA to around ATF so as to make Tm 
of primers equal with GC content of about 50%. Subsequently, partial DNA fragment of uPA is amplified according to a 
PCR method using a cDNA prepared using mRNA derived from human tissue material expressing uPA (for example, 
kidney) as template. The DNA fragment is cloned in E. coli by inserting the fragment into a suitable vector using a com- 
mercially available cloning kit. The plasmid thus obtained (for example pPPA) comprising DNA coding for ATF may be 
used as a starting material to construct an expression plasmid. 

Since a gene structure of HI-8 (UTI) was reported (44), DNA as starting material may be obtained according to a 
similar cloning method. A plasmid (pCD17R15), disclosed in Japanese Unexamined Patent Publication H6-247998, 
comprising HI-8 DNA sequence suitable for expression in E. coli may be used. The plasmid comprises a DNA 
sequence whose codons are used frequently in E. coli to produce HI-8 analogs in E. coli. In order to prepare DNA of 
ATFHI using the plasmids as starting material, it is important to obtain necessary DNA fragments by a PCR method and 
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also to introduce a suitable restriction site previously for the purpose of improving efficiency. It is necessary to obtain a 
optimum combination of a restriction site and synthetic DNA so as to improve efficiency of expression in £ coli as 
stated below. ~ 

In order to produce a desired protein in £ coli in large amounts, it is important to use a plasmid with high amplrfi- 

5 cation number (copy number) and to use a promoter sequence and a terminator sequence which are optimum for 
expression. Productivity is affected by a DNA sequence and length of a region from Shine-Dalgarno sequence in ribos- 
ome binding site to a translation initiation codon ATG (SD-ATG), or a higher-order structure of mRNA around traslation 
initiation point (38). The higher-order structure of mRNA near traslation initiation point is affected by the following DNA 
sequence coding for a N-terminal amino acid sequence. Therefore, it is important to consider a potential energy value 

io of a higher-order structure of mRNA around N-terminal sequence including SD-ATG region so as to design an expres- 
sion plasmid. An optimum mRNA structure may be obtained by replacing a natural cDNA sequence with a chemically 
synthesized DNA coding for SD-ATG region and several amino acids in N-terminal region of an expression plasmid. 
cDNA of uPA has a cleavage site of restriction enzyme Taql on codons from N-terminal amino acid to Ser at 9 position. 
A chemically synthesized DNA located on 5* side from the Taql site may be replaced with natural DNA sequence. As 

15 promoter, a potent tac promoter is often used, and a commercially-available taq promoter sequence (tac promoter Gen- 
Block, Pharmacia) may be used. The promoter has a BamH\ cohesive end sequence, a 3* side of which contains a 
Shine-Dalgarno sequence. Replacement using a chemically synthesized DNA sequence between BamHI and Taql 
which is suitable for expression in E. coli may be carried out by using the cohesive end and said Taq\ she (Fig. 4). 
DNA coding for chimeric protein ATFHI may be constructed by ligating two DNA fragments of ATF and HI-8, through 

20 recognition sites of restriction enzymes. An amino acid sequence at linkage site corresponds to Gly at 134 position of 
ATF and Thr at 1 position of HI-8. Kpnl recognition site may be created by codons corresponding to Gly-Thr. Since 
another Kpn\ recognition site does not exist in chimeric gene of ATFHI, the site may be used as a specific site for cleav- 
age and recognition of linkage site of ATF and HI-8. 

A specific procedure of producing a plasmid is shown below. Necessary DNA fragments are prepared according to 

25 a PCR method with modified primers by using, as template, said plasmids pPPA and pCD1 7R1 5 to be starting materials 
of ATF and HI-8 DNAs. An expression plasmid may be constructed after preparing the following two intermediate plas- 
mids. 

The intermediate plasmid pTAK comprising a DNA fragment coding for ATF portion (Ser 1 -Gly 134 ) may be produced 
as shown below. When plasmid pPPA as primer is amplified by PCR, a suitable DNA sequence upstream (5' side) of a 

30 Taq\ recognition site is selected as primer on 5* side. A primer on 3' side may be used to generate a Kpnl recognition 
site on 3' side of ATF (Fig. 5). The resulting PCR-amplified DNA is cleaved by Taql and Kpn\ to obtain a DNA fragment 
with cohesive ends of the restriction enzymes. This DNA fragment and a HindlU-BamHl adaptor having a tac promoter 
sequence, and BamH\-Taq\ adaptor chemically synthesized to improve efficiency of translation in £ coli are inserted 
in Hind\\\-Kpn\ site of pUCl9 to produce the intermediate plasmid pTAK (Fig.7). 

35 An intermediate plasmid pHIK having a DNA fragment coding for HI-8 (Thr 1 to Gly 59 ) may be produced as follows. 
The amino acid sequence of HI-8 encoded by pCD17R15 is different from an amino acid sequence predicted from 
cDNA at 9th position and 10th position. However, the same DNA fragment as cDNA may be obtained by PCR amplifi- 
cation using a primer to change a mutated amino acid Val at 9th position of HI-8 encoded by pCD17R15 to He and lie 
at 10th position to Val (Fig.6). A Kpnl site is introduced into 5' side thereof to be linked to 3' side of ATF. Furthermore, a 

40 termination codon TGA is introduced next to Gly at 59th position of HI-8 using a primer designed to create a recognition 
site of restriction enzyme Bcl\ on 3' side. The PCR-amplified DNA may be cleaved with Kpnl and Bell to obtain a DNA 
fragment with each cohesive end, which may be inserted in a Kpn\-BamH\ site of pUC18 to generate an intermediate 
plasmid pHIK (Fig. 8). 

Two intermediate plasmids pTAK and pHIK are cleaved by Kpnl and Xmnl to purify necessary DNA fragments. The 
45 fragments are combined together by ligation to produce an expression plasmid pAlP for production of chimeric protein 
ATFHI in £ coli (Fig.9). 

A chimeric protein may be produced by using a host cell, for example, £ coli (eg. JM109) into which the expression 
plasmid pAlP is introduced to produce a transformarrt. Production of ATFHI is induced by adding isopropyl p-D-thioga- 
lactopyranoside (IPTG) to a culture of the transformant at a suitable time. £ coli produces ATFHI as inclusion body. A 

50 steric structure of ATFHI may be reconstructed by well known purification procedure of inclusion body and refolding pro- 
cedure. A reconstructed ATFHI may be purified by a combination of conventional methods for purifying proteins, such 
as ion-exchange chromatography and gel filtration. 

The chimeric protein of the invention may be used as a cancerous metastasis inhibitor. The chimeric protein may 
be administered as injections for intravenous, intramuscular, subcutaneous, intracutaneous and intraperitoneal admin- 

55 istration, inhalations for intrapulmonary administration, oral medicines, suppositories, plasters, liquids and so on. Carri- 
ers added to the preparations are any of conventionally used carriers. The dosage per day is variable with 
administration route, age. sex, symptoms, types of cancer of the patient, but usually ranges from about 0.1-200mg for 
human adult. 
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Cancers whose metastasis is inhibited include leukemia, cancer of liver, renal cartinoma, pancreatic cancer, 
esophageal carcinoma, colon cancer, rectum cancer, malignant lymphoma, ovarian- cancer, cervical cancer, brain 
tumor, osteosalcoma, skin carcinoma, breast cancer and prostatic cancer. * 

A novel cancerous metastasis inhibitor with lower toxicity and potent inhibitory effect of invasion and metastasis of 
cancer to human may be produced leading to providing a very useful drug for cancer therapy. 

Best Mode for Carrying out the Invention 

Reference Example 1 : Preparation of plasmid pPPA 

DNA encoding ATF from commercially available cDNAs, which was amplified by a PCR method, was cloned in E. 
coli. A PCR reaction was conducted using synthetic primers Pr-1 (5-CGTGAGCGACTCCAAAGGCAGCAATG-3', SEQ 
ID NO 4) and Pr-2 (5-AAACCAGGGCTGGTTCTCGATGGTGGTG-3', SEQ ID NO 5) and cDNAs(QUICK-Clone cDNA, 
CLONTECH) from human kidney as template. In the PCR reaction, a commercially available PCR reaction kit (Gene 
Amp, Perkin Elmer Cetus) was used in a 100 mJ of reaction system including 1ng of cDNA, 50pmol of each primer. 30 
cycles of PCR was conducted wherein one cycle corresponded to 94 °C for 1 minute, 55 °C for 2 minutes and 72 °C for 
3 minutes. The amplified DNA having 538 bp was separated and purified, and then inserted into a vector pCR ll(lnvit- 
rogen) using PCR product cloning kit (TA Cloning Kit, Invitrogen). According to manual of the kit, a ligation reaction and 
transformation were conducted. A plasmid retained in the resulting transformed E. coli was purified by alkaline method 
(YODOSHA, IDENSIKOGAKU HANDBOOK, pp.19-26, 1991). It was confirmed that a desired plasmid pPPA was cor- 
rectly constructed by examining a restriction enzyme cleavage pattern and base sequence of DNA with a DNA 
sequencer (ALF DNA Sequencer, Pharmacia). 

Reference Example 2: Preparation of plasmid pCD17R15 

Each oligonucleotides of base sequences (1) to (10) (SEQ ID NOS 17 and 18) as shown in Fig.10 was chemically 
synthesized by phosphoamidide method with automatic DNA synthesizer (Model 381 A, Applied Biosystems) . Protec- 
tive groups of synthesized DNAs were removed by warming at 55 °C overnight in cone, aqueous ammonia. The result- 
ing compound was purified using a reverse phase column for purification of oligonucleotide (OPC Cartridge Column, 
Applied Biosystems). When necessary, 5* end of synthetic DNAs were phosphorylated by a reaction at 37°C for 1 hour 
in solution containing 50 mM Tris-HCI (pH 7.6) with 16 units of polynucleotidekinase (TOYOBO), 1 mM MgCI 2 , 0.5mM 
dithiothreitol (DTT) and 1mM ATR The reaction mixture was then separated by polyacrylamide gel electrophoresis 
(PAGE) with gel concentration of 20% containing 7M urea. After staining gel with ethydium bromide, a band portion con- 
taining desired oligonucleotides was cut out on long wavelength (365nm) ultraviolet generator. Sliced gel was crashed 
with ImM of DNA eluting solution (20mM Tris-HCI, pH 8.0, 1.5mM EDTA), and which was shaken at 37 °C overnight 
and centrifuged. A supernatant was subjected to a desalting column to obtain a synthetic oligonucleotide solution. 
Complementary upper and lower chains in Fig.10, for example, synthetic oligonucleotides of base sequence (1) and 
base sequence (6) were mixed in equimolar quantity in a solution containing 50mM Tris-HCI (pH7.6) and 10mM MgCI 2 
and the solution was treated at 90 °C for 5 minutes. Annealing of DNA was conducted by slowly cooling the solution to 
room temperature by allowing the solution to stand. Annealed synthetic DNA fragment was separated using urea-free 
PAGE with gel concentration of 10%, and was purified from cut gel. 

A plasmid pTV118N (TAKARA) was cleaved by restriction enzymes EcoRI and Kpnl. After agarose gel electro- 
phoresis for separation, a desired DNA band was cut. The gel section was frozen at -80°C for 1 hour and then quickly 
heated to 37 *>C for filtration with centrifugation-type filter (Millipore) having a pore size of 0.1pm 

The filtrate solution was extracted with phenol, and then precipitated with ethanol to purify a DNA fragment. The 
DNA fragment and annealed 5 sets of synthetic DNA fragments were mixed in a solution containing 50mM Tris- 
HCI(pH7.6) t 10mM Mga 2 , 10mM DTT and 1mM ATP and then ligated with 10 units of T4 DNA ligase (TAKARA) at 4°C 
overnight. Transformation was conducted using a commercially available E. coli JM109 competent cell (TAKARA). A 
desired plasmid was selected by separation and purification of plasmids from transformed £. coli. Structure of the 
desired plasmid was confirmed by analysis of a restriction enzyme cleavage patterm and DNA base sequence. The 
plasmid thus obtained was named pEK7 (Fig.11). The following two synthetic DNAs are complement with each other 
and form double strand DNA retaining BspH! cohesive end at 5' end and blunt end at 3* end : 
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5 1 -CATGAAAA71AACCGCTATCGCTATCGCTGTTGCTCTGGCTGGTTTTGCTAC 
CGTTGCTCAGGCC-3 1 , SEQ ID NO. 6; 



5 1 - GGC C T GAGCAAC G G TAGCAAAAC CAGC CAGAGCAAC AG C GAT AGCGAT AGC 
GGTTTTTTT-3 ' , SEQ ID NO 7 



The DNA fragments encode amino acids of signal peptide of E. coii outermembrane protein A(OmpA). The DNA 
fragment prepared according to the above-mentioned method, and a DNA fragment having 0.25 kb generated by cleav- 
age of plasmid pEK7 with restriction enzymes R$a\ and EcoRI, were ligated into Nco\-EcoR\ site of pTV1 18N. Accord- 
ing to previously descrived method, transformation of E. coii and separation and purification of a plasmid from the 
transformed E. coii were conducted. It was confirmed by analysis of restriction enzyme cleavage pattern and DNA base 
sequence of plasmid that a desired plasmid pCD17R15 was obtained 

Example 1 

Construction of expression plasmid 

(1) Construction of pTAK plasmid (Fig. 7) 

Treatments were conducted to obtain a necessary part of DNA by PCR using plasmid pPPA as template. Syn- 
thetic primer Pr-SCS'-GGGTACCATCTGCGCAGTCATGCAC-S*, SEQ ID NO 8) was designed to create a Kpnl site 
on 3' side of DNA coding for ATF (Fig.5). In a synthetic system (100^1) containing plasmid pPPA (10ng), and 100 
pmol portions of primers Pr-1 and Pr-3, 25 cycles of PCR reaction were conducted wherein one cycle corre- 
sponded to 94°C for 1 minute, 50°C for 1 minute and 72°C for 3 minutes. Amplified PCR product was purified by 
ethanol precipitation, cleaved by restriction enzymes Taq\ and Kpnl and separated by 1.5% agarose gel electro- 
phoresis. A DNA fragment having 379 bp was cut off from gel. DNA was recovered using centrifugation tube with 
filter for DNA recovery (SpinBind DNA Extraction Units, FMC BioProducts). The DNA fragment having 379 bp 
encodes 10-134 amino acid sequence of uPA (Fig.7-(3)). 

For the purpose of efficient expression of desired product in E coii, DNA coding for N-terminal 1-9 amino acid 
sequence of ATF next to initiation Met was chemically synthesized. The following two synthetic DNAs are comple- 
ment with each other and form BamH\ cohesive end on 5' side and Taql cohesive end on 3' side : 

5' -GATCCAATCAAATGAGTAATGAACTACATCAAGTACCAT-3 1 , SEQ ID NO 

9; 

5' -CGATGGTACTTGATGTAGTTCATTACTCATTTGATTG-3 ' / SEQ ID NO 10 



5' ends of the synthetic DNAs were phosphorylated using T4 polynucleotidekinase (TAKARA) and ATP, and 
then annealed by boiling at 100°C for 2 minutes followed by spontaneous cooling to form an adaptor DNA. The 
BamHI-Taql adaptor encoded 5' non-translation sequence consisting of 1 1 bases, initiation codon and following 9 
amino acid sequence from N-terminal of ATF (Fig.7-(2)). 100ng of DNA fragment prepared by cleavage of pUCl9 
with Kpnl and HindUl followed by dephosphorylation using bacterial alkaline phosphatase (BAR TAKARA), 
20pmol of tac promoter DNA adaptor {tac promoter GenBlocK Pharmacia), 20pmol of BamH\-Taq\ adaptor and 
200ng of Taq-Kpn\ DNA fragment having 379bp were ligated using a commercially available DNA ligation kit (DNA 
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Ligation Kit Ver.2, TAKARA). The product was introduced into E. coli JM109 competent cells (TAKARA) to obtain a 
transformant. A plasmid was prepared from the resulting transformarrt. It was confirmed that the desired plasmid 
pTAK was obtained by analysis of restriction enzyme cleavage paffern and DNA base sequence. 

2. Construction of pHIK plasmid (see Fig.8) 

5 Plasmid pCD17R15 has a DNA sequence of HI-8 variant whose codons are converted into frequently used 

codons in E. coli to improve expression efficiency in E. coli (Japanese Unexamined Patent Publication H6-247998, 
Fig. 11). The amino acid sequence of HI-8 encoded by the plasmid is different from amino acids predicted from 
reported cDNA sequence in 9th position, 10th position and 61st position. In order to ligate DNAs of ATF and HI-8 
with Kpnl, a primer Pr-4 (5 , -GGGTACCGTTGCTGCTTGCAACCTGCCGATTGTCCG-3 , 1 SEQ ID NO 1 1) to change 

10 Val to lie at 9th position and lie to Val at 1 0th position was designed. A primer Pr-5 (S'-GTGATCAACCCGGAACAC- 
CGCAATATTCACGG-3', SEQ ID NO 12) for modification of DNA was designed to introduce a termination codon 
TGA into a position adjacent to Gly at 59th position of HI-8 and to have Bc/I recognition site simultaneously. In a 
reaction system (100^1) containing a plasmid pCD17R15 (10ng) as template DNA, and each 100 pmol portions of 
primers Pr-4 and Pr-5, 25 cycles of PCR reaction were conducted wherein one cycle corresponded to 94°C for 1 

is minute, 50°C for 1 minute and 72°C for 3 minutes. Amplified PCR product was collected by ethanol precipitation, 
and then cleaved by Kpn\ and Bc/I to obtain DNA fragment having 176 bp (Fig.8-(1)). A vector DNA was prepared 
by cleaving pUC18 by Kpnl and Ban?HI, followed by dephosphorylation by BAP treatment (Fig.8-(2)). 100ng of the 
vector DNA and 200ng of DNA fragment having 1 76 bp were ligated using a ligation kit (Fig.8-(3)). The product was 
introduced into E. coli JM 1 09 competent cell to separate a transformant. A plasmid prepared from the transformant 

so was subjected to analysis of DNA base sequence to confirm that the desired plasmid pHIK was constructed as des- 
ined. 

3. Construction of expression plasmid pAlP (Fig.9) 

The plasmids pTAK and pHIK were cleaved by Kpn\ and Xmn\ respectively and then separated by 1 .0% aga- 
rose gel electrophoresis to purify a DNA fragment having 2356bp (Fig.9-(1)) derived from pTAK and a DNA frag- 

25 ment having 997bp (Fig.9-(2)) derived from pHIK. Subsequently, 1 0Ong portions of each DNA fragment were mixed 
and ligated using a ligation kit (Fig.9-(3)). The product was introduced into E. coli JM109 competent cell to separate 
a transformant. A plasmid was prepared from the transformant. It is confirmed by examining base sequence thereof 
that the expression plasmid pAlP was constructed as designed. The E. coli JM109 strain retaining pAlP was inter- 
nationally deposited in National Institute of Bioscience and Human-Technology on November 15, 1995 as FERM 

30 BP-5293. 

Example 2 : Expression of chimeric protein in E. coli 

E. coli JM109 transformant strain retaining a plasmid pAlP was placed on 5 ml of Terrific Broth (TB medium; 1.2% 

35 bactotrypton, 2.4% yeast extract 0.4% glycerol, KH 2 P0 4 2.31g/l, K 2 HP0 4 12.54g/l) containing 100jig/ml of ampi- 
ciline(Amp) and shaken for culture at 37° overnight. The culture medium was transferred into 50 ml of fresh TB medium 
(1 OOng/ml of Amp) and precultured for 4 hours. The culture was transferred to 400 ml of TB medium (100ng/ml of Amp) 
to maintain cultivation. IPTG was added thereto to a final concentration of 0.5mM, when OD 600 (absorbance of culture 
medium at 600 nm) was reached to about 0.5. The mixture was further cultured overnight. 

40 Bacterial cells were collected by centrif ugation (1 0 ( 000xg, 5 minutes) and washed with lysis buffer (50mM Tris-HCI, 
pH 8.0, 50mM NaCI, 1mM EDTA). Bacterial cells were collected by centrifugation and resuspended in 50ml of lysis 
buffer containing 0.25mg/ml of lysozyme. After standing at 0°C for 1 hour, bacterial cells were disrupted with ultrasonic 
wave. The disrupted lysate was then centrifuged (4,400xg, 5 minutes) to obtain an insoluble precipitation fraction. The 
precipitation fraction was washed with lysis buffer, and then washed with 0.5% Triton X-100, 10mM EDTA(pH8.0) aque- 

45 ous solution and finally washed with lysis buffer to purify an inclusion body fraction. 

The inclusion body fraction was dissolved in 20 ml of 6M guanidine hydrochloride, 50mM Tris-HCI(pH7.0), 1mM 
EDTA and 1% 2-mercaptoethanol solution, to which one litre of refolding buffer (1M guanidine hydrochloride, 50mM 
Tris-HCI, pH7.0. 1mM EDTA, 2mM reduced form glutathione, 0.2mM oxidized form glutathione) was added, and then 
allowed to stand at room temperature overnight. The refolding solution was sufficiently dialyzed against 20mM phos- 

so phate buffer (pH6.5) as an outer solution. 

Insoluble matter of dialyzed refolding solution was removed with Wattman No.2 filter paper, and further filtered with 
a membrane filter of pore size 0.22 fim. The resulting solution was added to a bufferized ion-exchange membrane chro- 
matography cartridge (SP MemSep 1000, MILLIPORE). Adsorbed fractions were eluted by linear concentration gradi- 
ent using 0 to 1 M sodium chloride (20mM phosphate buffer, pH 6.5). The fractions were concentrated with centrifugal 

55 ultrafilter (Centriplus concentrators: fractional molecular weight 3,000. Amicon), and then added to Superdex 75 
(HiLoad 26/60, Pharmacia) equilibrated with 0.2M NaCI and 50 mM phosphate buffer (pH6.5) for gel filtration. Peak 
fractions of absorbance at 280 nm were collected and dialyzed against 20mM phosphate buffer (pH6.5) as an outer 
solution. The resulting solution was added to an ion-exchange column (RESOURCE S, Pharmacia) previously equili- 
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brated with 20mM phosphate buffer (pH6.5). An adsorbed ATFHI was chromatographically eluted by linear concentra- 
tion gradient using 0 to 0.5 M sodium chloride solution (20mM phosphate buffer, pH 6.5). 

Example 3 : Confirmation of purified ATFHI 

5 

Examination of purified ATFHI by 20% SDS-PAGE confirmed a single band corresponding to 21.5kDa calculated 
based on the amino acid sequence thereof. In addition, it was confirmed by transferring the protein in electrophoresis 
gel to membrane according to western blotting method to check a reactivity between the protein and the antiserum that 
the protein band corresponding to 21 .5kDa strongly reacted with antiserum against HI-8. Furthermore, an expected N- 
w terminal sequence of ATFHI consisting of 1 4 amino acids. Ser-Asn-Glu-Leu-His-Gln-Val-Pro-Ser-Asn-(Cys)-Asp-(Cys)- 
Leu, except for Cys was confirmed by checking an N-terminal amino acid sequence thereof with a protein sequencer 
(Model 477A, Applied Biosystems). The results confirmed that initiation methionine was removed as expected, when 
ATFHI was directly expressed within E. coli. 

is Example 4 : Preparation of chimeric protein ATFHI-CL 

ATFHI-CL is a chimeric protein having a polypeptide of 1-134 amino acid sequence of uPA (Ser 1 to Gly 134 ) on the 
side of N-terminal and a polypeptide of 66 amino acid sequence of HI-8 (Thr 1 to Leu 66 ) on the side of C-terminal 
(Fig. 12; SEQ ID NO 2). An expression plasmid pAIP-CL to prepare the chimeric protein inE. coli was produced accord- 
20 ing to the following process (Fig.13). A plasmid pAlP was cleaved with restriction enzymes, Bsml and Xba\, and 
dephosphorylated by BAP treatment. A DNA fragment having 3323 bp was purified by separating the mixture using 1% 
agarose gel electrophoresis (Rg.13-(1)). The following two synthetic DNAs are complementary with each other, and 
form a Bsm\ cohesive end on 5' side and a Xba\ cohesive end on 3* side: 

25 5 ' -GTGAATATTGCGGTGTTCCGGGTGATGGTGATGAAGAACTGCTGTGATCCT- 

3', SEQ ID NO 13; 
30 5 1 - C T AGAG GAT C ACAG CAG T T C T T CAT CAC CAT CAC C C G GAACAC C G C AAT AT T C 

ACGG-3', SEQ ID NO 14. 

35 

5' ends of the chemically synthesized DNAs were phosphorylated with T4 polynucreotidekinase (TAKARA) and 
ATP, boiled at 100°C for 2 minutes and then cooled spontaneously for annealing to form adaptor DNA (Fig.13-(2)). 
lOpmol of the Bsm\-Xba\ adaptor DNA and lOOng of the DNA fragment having 3323 bp were tigated with a ligation kit 

40 (Fig.13-(3)). The product was introduced into E. coli JM109 competent cell to separate a transformant. It is confirmed 
by checking a base sequence of the plasmid prepared from the transformant thus obtained that a desired plasmid pAlP- 
CL was constructed as designed. The E. coli JM109 strain retaining pAIP-CL was domestically deposited in National 
Institute of Bioscience and Human-Technology located at 1-3, Higashi 1-chome, Tsukuba-shi, Ibaraki-ken Japan, on 
December 22, 1995 as FERM P-15364, and transferred to an international deposition on November 14, 1996 as FERM 

45 BP-5746. The E. coli with the plasmid was cultured to purify a chimeric protein ATFHI-CL according to a procedure of 
example 2. It was confirmed by 20% SDS-PAGE that the purified ATFHI-CL was a single band corresponding to a 
molecular weight of 22.3kDa as determined by calculation. It was confirmed that the protein band corresponding to 
22.3kDa strongly reacted with antiserum against HI-8, by transferring the protein in electrophoresis gel to membrane 
according to western blotting method, followed by examining a reactivity of the protein to antiserum against HI-8. 

so 

Example 5 : Preparation of chimeric protein ATFHI -ML 

ATFHI-ML is a chimeric protein having a polypeptide of 1-147 amino acid sequence of uPA (Ser 1 to Gin 147 ) on the 
side of N-terminal and a polypeptide of 1-59 amino acid sequence of HI-8 (Thr 1 to Gly 59 ) on the side of C-terminal 
55 (Fig.14; SEQ ID NO 3). An expression plasmid pAIP-ML to prepare the chimeric protein in E. coli was prepared accord- 
ing to the following process (Fig.15). A plasmid pPPA was cleaved with restriction enzymes Apo\ and Nco\ and then 
separated by 3% agarose gel electrophoresis. A DNA fragment having 236 bp was cut and purified from the gel (Fig. 1 5- 
(1)). A plasmid pAlP was cleaved with Ncoi and Kpn\ and separated by 1% agarose gel electrophoresis to purify a DNA 
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fragment having 3146 bp (Fig.15-<2)). The following two synthetic DNAs are complementary with each other, and form 
a Apo\ cohesive end on 5' side and a Kpn\ cohesive end on 3' side. 5* ends of the chemically synthesized DNAs, 5- 
AATTTC AGGGTAC-3 1 (SEQ ID NO 15) and S'-CCTAG-S' (SEQ ID MO 16) were phosphorylated with T4 polynucre- 
otidekinase (TAKARA) and ATP, boiled at 100°C for 2 minutes and then cooled spontaneously for annealing to form 

5 adaptor DNA (Fig.15-(3)). 30pmol of the Apo\-Kpn\ adaptor DNA and 100ng portions of each DNA fragments having 
236bp and 3146bp were ligated with a ligation kit (Fig.15-(4)). The product was introduced into E. coli JM109 compe- 
tent cell to separate a transformant. It is confirmed by checking a base sequence of a plasmid prepared from the trans- 
formant thus obtained that a desired plasmid pAIP-ML was constructed as designed. The E. coli JM1 09 strain retaining 
pAIP-ML was domestically deposited in National Institute of Bioscience and Human-Technology located at 1 -3, Higashi 

io 1-chome, Tsukuba-shi, Ibaraki-ken Japan, on December 22, 1995 as FERM P-15363, and transferred to an interna- 
tional deposition on November 14, 1996 as FERM BP-5745. 

The E. coli with the plasmid was cultured to purify a chimeric protein ATFHI-ML according to a procedure of exam- 
ple 2. It was confirmed by 20% SDS-PAGE that the purified ATFHI-ML was a single band corresponding to a molecular 
weight of 23.1kDa as determined by calculation. It was confirmed that the protein band corresponding to 23.1 kDa 

is strongly reacted with antiserum against HI-8, by transferring the protein in electrophoresis gel to membrane according 
to western blotting method, followed by examining a reactivity of the protein to antiserum against HI-8. 

Example 6 : Plasmin inhibition experiment 

20 140 nJ of PBS, 20 yJ of a 6.25 nM ptasmin aqueous solution and 20 pJ of 0-10 pM sample were added to each well 
of 96-well microtiter plate in this sequence. After maintaining temperature at 23° for 5 minutes, 20 jjiI of a synthetic sub- 
strate S-2251 solution (1mg/ml) was added thereto to start a reaction. After 30 minutes, 20 \i\ of 20% acetic acid was 
added to stop the red ion. Absorbance at 405nm was determined to graphically indicate a relative ratio of the absorb- 
ance to an absorbance without addition of an inhibitor (Fig.16). As a result, although the chimeric protein was weaker 

25 than UTI and HI-8 (1/2 to 1/3 of IC 50 ), the chimeric protein had a similar types of plasmin inhibitory activity, which con- 
firmed that the chimeric protein maintained characteristics of HI-8. 

Example 7 : Experiment on inhibition of uPAR binding 

30 An experiment on inhibition of uPA binding to a receptor (uPAR) by a chimeric protein was conducted using human 
histocytic lymphoma strain U937. A fluorescein isothiocyanate(FITC)-labe!led uPA as ligand was prepared as follows. 
10mg of uPA was dissolved in 2ml of 0.1 M NaHC0 3 (pH9.0). A solution of 1mg FITC in 1ml of dimethylsulphox- 
ide(DMSO) was added to the solution and mixed. After stirring and mixing the solution at room temperature for 3 hours, 
the mixture was subjected to a gel filtration column for desalting (PD-10, Pharmacia) to purify FITC-labelled uPA. U937 

35 cells stimulated by phorbol 12-myristate 13-acetate(PMA, Sigma) was collected, acid-treated with 50mM glycine-HCI 
and 0.5M NaCI(pH3.0), and then neutralized with 0.5M HEPES buffer and 0.1M NaCI (pH7.5). Endogenous uPA bound 
to uPAR may be removed by the treatment. PBS(398jil) containing CM.OOOnM of sample(IOOnJ), 2\i\ of 1mg/ml FITC- 
labelled uPA and 0.1%BSA was added to 500pl of U937 cell, which was adjusted to 1x10 6 cells/ml (0.1% BSA, PBS). 
The mixture was allowed to stand at 4°C for 30 minutes. The amount of FITC-labelled uPA bound to the cell was deter- 

40 mined with EPICS PROFILE flow cytometry. The results confirmed that the chimetic protein ATFHI had an inhibitory 
effects on binding of labelled uPA similar to unlabelled uPA. (Fig. 17). The results confirmed that the chimeric protein 
maintained a G domain function of uPA. 

Example 8 : Experiment on inhibition of cancer cell invasion in vitro 

45 

in the experiment on inhibition of invasion, culture cells of human ovarian cancer cell line HOC-1 , human chorio- 
carcinoma cell line SMT-cc1 t human breast cancer cell line MDA-MB-435, human malignant melanoma cell line A375, 
human prostatic cancer cell line PC-3, DU-1 45, human colon cancer cell line GEO and mouse Lewis lung tumor cell line 
3LL were used. 

so 100 |xl of Matrigel diluted 20-fold with PBS was added to a cup provided with polycarbonate filter (8nm pore size) 
(Transwell. COSTER), and dried for coating the filter surface. 600fil of RPMI 1640 and 0.1% BSA was added to a lower 
side of modified Boyden chamber. 1 00]la1 of sample whose concentration was adjusted variously with serum-free 
medium was added to an upper side of chamber (cup provided with filter). After maintaining temperature at 23°C for 1 
hour, 100*iI of cancer cell suspension (2x1 0 6 cells/ml) was added to an upper side of chamber. Fibroblast conditioned- 

55 medium as chemotactic substance was added to a lower side of chamber. The chamber was transferred to 5% C0 2 
incubator for culture at 37°C for 12 hours. Cells remained on upper side of filter were swabbed and then the filter was 
stained. The number of cells invaded into lower side of filter was counted under microscope to determine a sample con- 
centration (ID 50 ) at which the number of invasion cells were half (table 1). The experiment was independently repeated 
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3 times under the same conditions, respectively. The amount of uPAR expressed on each culture cell was determined 
by calculation of Scatchard plot using iode-labelled uPA. 

The results demonstrate that the chimeric protein has similar effects on a cancer cell derived from mouse 3LL to 
UTI and HI-8 and more potent invasion inhibitory effects on human cancer cells than UTI and HI-8. This confirms that 
5 the chimeric protein specifically binds to human uPAR and that the chimeric protein has more potent effects than a 
crosslinked compound (ATF + HI-8 conjugate on table 1) prepared by combining ATF and HI-8 by a crosslinking agent 
(N-succinimidyl-3-(2-pyridyldithio)propionate) (32). The inhibitory effect of chimeric protein is proportional to the 
amount uPAR expressed on each cell. The chimeric protein indicates a higher inhibitory effect on SMT-ccI and DU-145 
and like cells having an increased amount of expressed uPAR. 

70 

Table 1 

Invasion inhibitory effect on each culture cell ID 5Q (nM) 



ATF4HI-8 uPAR 



so 


cell 


UTI 


HI-8 


ATF 


conjugate 


ATFBI 


ATFHI -CL 


ATFHI-ML 


site/cell 


HOC- 1 


200 


180 


1000 


70 


10 


20 


50 


76000 




SMT-ccI 


100 


220 


500 


10 


1.2 


1 


10 


108000 


25 


A375 


80 


100 


1000 


80 


50 


100 


120 


12000 




MDA-MB-435 


70 


110 


800 


30 


3.5 


20 


5.6 


87000 


30 


GEO 


300 " 


200 


>1000 


150 


110 


200 


20 


5000 


PC-3 


50 


50 


>1000 


20 


20 


20 


50 


20000 




DU-145 


260 


150 


300 


30 


0.5 


5 


2.9 


96000 


35 


3LL 


250 


200 


>1Q00 


200 


300 


300 


200 


N.D. 



40 Example 9 : Experiment on metastasis inhibition of human cancer cell in nude mouse 

Nude mice (Baltic nu/nu, Charles River Japan) were fed in sterilized room giving sterilized food and water. A sus- 
pension of 1x10 7 prostatic cancer PC-3 cells in 0.2ml of Dullbecco's modified Eagle medium(DMEM) was transplanted 
to 5-week-aged male mouse subcutaneously. 50p.g of ATFHI or physiological saline was injected subcutaneously after 
45 0 f 7 and 14 days from transplanted day, respectively. After 6 weeks from transplantation of tumor, subcutaneous tumor 
was removed by operation, and metastasized tumor in lymph node was observed. In the experiment, as shown in table 
2, matastasis in lymph node was observed in about half tumor-inoculated mouse (16/31), and ATFHI inhibited the 
metastasis significantly (3/20). 

50 



55 
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Table 2 

Method of Cell Metastasis 

Sample Cell Inoculation Number Population 

Saline PC 3 s.c. 1*10 7 16/31 

ATFHI PC 3 s.c. lxl pi 3/20 



Example 10 : Cytotoxicity in vitro 

Cytotoxicity of ATFHI to uPAR expressing cells was examined by observation of growth inhibition of culture cells. 

20 2x1 0 4 of culture cells (HOC-1 , SMT-ccI, PC-3, 3LL) were cultured overnight in 96-well plate. Medium was changed to a 
leucine(-) medium containing a various concentration of ATFHI, ATF or HI-8. After culture at 37°C for 20 hours, 1jiCi of 
( 3 H) leucine was added thereto, and then the cells were cultured for further 6 hours. The cells collected were disrupted 
by freeze and thawing, to determine radioactive leucine incorporated during protein synthesis with Betaplate scintilla- 
tion counter, Pharmacia. The results indicate that ATFHI does not kill cells at concentration of 20ug/ml(about 1pM) and 

25 does not affect protein synthesis. 

Example 1 1 : Activation of cell growth in vitro 

1 ml of cell solution containing 1 ,000 cancer cells (HOC-1 , SMT-ccI, PC-3, 3LL) was placed in each well of 24-well 
30 plate and cultured. After 24 hours, a variety concentrations of ATFHI diluted with PBS containing 0.2% human serum 
albumin was added thereto. After culture for further 7 days, cells were stained and observed under microscope. The 
results confirm that the chimeric protein do not activate growth of cancer cells. 
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Sequence Listing 
SEQ ID NO : 1 
SEQUENCE LENGTH: 624 
SEQUENCE TYPE: nucleic acici 
STRANDNESS: double 
TOPOLOGY: linear 
SEQUENCE : 

GATCCAATCA A ATG ACT AAT GAA CTA CAT CAA GTA CCA TCG AAC TCT GAC 50 

20 Met Ser Asn Glu Leu His Gin Val Pro Ser Asn Cys Asp 

-11 5 10 

TGT CTA AAT GGA GGA ACA TGT GTG TCC AAC AAG TAC TTC TCC AAC ATT 98 

25 

Cys Leu Asn Gly Gly Thr Cys Val Ser Asn Lys Tyr Phe Ser Asn He 
15 20 25 

30 CAC TGG TGC AAC TGC CCA AAG AAA TTC GGA GGG CAG CAC TGT GAA ATA 146 

His Trp Cys Asn Cys Pro Lys Lys Phe Gly Gly Gin His Cys Glu He 

30 35 40 

GAT AAG TCA AAA ACC TGC TAT GAG GGG AAT GGT CAC TTT TAC CGA GGA 194 
Asp Lys Ser Lys Thr Cys Tyr Glu Gly Asn Gly His Phe Tyr Arg Gly 
45 50 55 60 

AAG GCC AGC ACT GAC ACC ATG GGC CGG CCC TGC CTG CCC TGG AAC TCT 242 
Lys Ala Ser Thr Asp Thr Met Gly Arg Pro Cys Leu Pro Trp Asn Ser 

65 70 75 

GCC ACT GTC CTT CAG CAA ACG TAC CAT GCC CAC AGA TCT GAT GCT CTT 290 
Ala Thr Val Leu Gin Gin Thr Tyr His Ala His Arg Ser Asp Ala Leu 
80 85 90 

55 
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CAG CTG GGC CTG GGG AAA CAT AAT TAG TGC AG€ AAC CCA GAC AAC CGG 338 
Gin Leu Gly Leu Gly Lys His Asn Tyr Cys Arg Asn Pro Asp Asn Arg 

95 100 105 

AGG CGA CCC TGG TGC TAT GTG CAG GTG GGC CTA AAG CCG CTT GTC CAA 386 
Arg Arg Pro Trp Cys Tyr Val Gin Val Gly Leu Lys Pro Leu Val Gin 

110 115 120 

GAG TGC ATG GTG CAT GAC TGC GCA GAT GGT ACC GTT GCT GCT TGC AAC 434 
Glu Cys Met Val His Asp Cys Ala Asp Gly Thr Val Ala Ala Cys Asn 
125 130 135 140 

CTG CCG ATT GTC CGT GGT CCG TGC CGT GCT TTC ATC CAG CTG TGG GCT 482 
Leu Pro He Val Arg Gly Pro Cys Arg Ala Phe He Gin Leu Trp Ala 

145 150 155 

TTC GAC GCT GTT AAA GGT AAA TGC GTT CTG TTC CCG TAT GGT GGT TGC 530 
Phe Asp Ala Val Lys Gly Lys Cys Val Leu Phe Pro Tyr Gly Gly Cys 

160 165 170 

CAG GGT AAC GGT AAC AAA TTC TAT TCT GAA AAA GAA TGC CGT GAA TAT 578 
Gin Gly Asn Gly Asn Lys Phe Tyr Ser Glu Lys Glu Cys Arg Glu Tyr 
35 175 180 185 

TGC GGT GTT CCG GGT TGATCCTCTAGAGTCGACCTGCAGGCATGCA 624 
Cys Gly Val Pro Gly 
190 193 



75 
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SEQ ID NO : 2 
SEQUENCE LENGTH: 645 
SEQUENCE TYPE: nucleic acid 
STRANDNESS: double 
TOPOLOGY: linear 
SEQUENCE : 

GATCCAATCA A ATG AGT AAT GAA CTA CAT CAA GTA CCA TCG AAC TGT GAC 
Met Ser Asn Glu Leu His Gin Val Pro Ser Asd Cys.Asp 
-11 5 10 

TGT CTA AAT GGA CGA ACA TGT GTG TCC AAC AAG TAC TTC TCC AAC ATT 
Cys Leu Asn Gly Gly Thr Cys Val Ser Asn Lys Tyr Phe Ser Asn lie 

15 20 25 

CAC TGG TGC AAC TGC CCA AAG AAA TTC GGA GGG CAG CAC TGT GAA ATA 
His Trp Cys Asn Cys Pro Lys Lys Phe Gly Gly Gin His Cys Glu He 

30 35 40 

GAT AAG TCA AAA ACC TGC TAT GAG GGG AAT GGT CAC TTT TAC CGA GGA 
Asp Lys Ser Lys Thr Cys Tyr Glu Gly Asn Gly His Phe Tyr Arg Gly 
45 50 55 60 

AAG GCC AGC ACT GAC ACC ATG GGC CGG CCC TGC CTG CCC TGG AAC TCT 
Lys Ala Ser Thr Asp Thr Met Gly Arg Pro Cys Leu Pro Trp Asn Ser 

65 70 75 

GCC ACT GTC CTT CAG CAA ACG TAC CAT GCC CAC AGA TCT GAT GCT CTT 
Ala Thr Val Leu Gin Gin Thr Tyr His Ala His Arg Ser Asp Ala Leu 
80 85 90 
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CAG CTG GGC CTG GGG AAA CAT AAT TAC TGC AGC AAC CCA GAC AAC CGG 338 
Gin Leu Gly Leu Gly Lys His Asn Tyr Cys Arg Asn Pro Asp Asn Arg 

95 100 105 

AGG CGA CCC TGG TGC TAT GTG CAG GTG GGC CTA AAG CCG CTT GTC CAA 386 
Arg Arg Pro Trp Cys Tyr Val Gin Val Gly Leu Lys Pro Leu Val Gin 

110 115 120 

GAG TGC ATG GTG CAT GAC TGC GCA GAT GGT ACC GTT GCT GCT TGC AAC 434 
Glu Cys Met Val His Asp Cys Ala Asp Gly Thr Val Ala Ala Cys Asn 
125 130 135 140 

20 CTG CCG ATT GTC CGT GGT CCG TGC CGT GCT TTC ATC CAG CTG TGG GCT 482 

Leu Pro He Val Arg Gly Pro Cys Arg Ala Phe lie Gin Leu Trp Ala 

145 150 155 

TTC GAC GCT GTT AAA GGT AAA TGC GTT CTG TTC CCG TAT GGT GGT TGC 530 
Phe Asp Ala Val Lys Gly Lys Cys Val Leu Phe Pro Tyr Gly Gly Cys 

160 165 170 

CAG GGT AAC GGT AAC AAA TTC TAT TCT GAA AAA GAA TGC CGT GAA TAT 578 
Gin Gly Asn Gly Asn Lys Phe Tyr Ser Glu Lys Glu Cys Arg Glu Tyr 
35 175 180 185 

TGC GGT GTT CCG GGT GAT GGT GAT GAA GAA CTG CTG TGATCCTCTAGAGTCG 630 
Cys Gly Val Pro Gly Asp Gly Asp Glu Glu Leu Leu 

40 

190 195 200 

ACCTGCAGGCATGCA 645 

45 
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SEQ ID NO : 3 _ . 

SEQUENCE LENGTH: 66 6 

SEQUENCE TYPE: nucleic acid 

STRANDNESS : double 

TOPOLOGY: linear 

SEQUENCE; 

GATCCAATCAA ATG ACT AAT GAA CTA CAT CAA CTA CCA TCG AAC TGT GAC 50 
Met Ser Asn Glu Leu His Gin Val Pro Ser Asn Cys Asp 

5 10 
TGT CTA AAT GGA GGA ACA TGT GTG TCC AAC AAG TAC TTC TCC AAC ATT 98 
Cys Leu Asn Gly Gly Thr Cys Val Ser Asn Lys Tyr Phe Ser Asn He 

15 20 25 

CAC TGG TGC AAC TGC CCA AAG AAA TTC GGA GGG CAG CAC TGT GAA ATA 146 
His Trp Cys Asn Cys Pro Lys Lys Phe Gly Gly Gin His Cys Glu lie 

30 35 40 

GAT AAG TCA AAA ACC TGC TAT GAG GGG AAT GGT CAC TTT TAC CCA GGA 194 
Asp Lys Ser Lys Thr Cys Tyr Glu Gly Asn Gly His Phe Tyr Arg Gly 
45 50 55 60 

AAG GCC AGC ACT GAC ACC ATG GGC CGG CCC TGC CTG CCC TGG AAC TCT 242 
Lys Ala Ser Thr Asp Thr Met Gly Arg Pro Cys Leu Pro Trp Asn Ser 

65 70 75 

GCC ACT GTC CTT CAG CAA ACG TAC CAT GCC CAC AGA TCT GAT GCT CTT 290 
Ala Thr Val Leu Gin Gin Thr Tyr His Ala His Arg Ser Asp Ala Leu 
80 85 90 
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CAG CTG GGC CTG GGG AAA CAT AAT TAG TGC-AGG AAC CCA GAC AAC CGG 338 

Gin Leu Gly Leu Gly Lys His Asn Tyr Cys Arg Asn Pro Asp Asn Arg 

95 100 105 

AGG CGA CCC TGG TGC TAT GTG CAG GTG GGC CTA AAG CCG CTT GTC CAA 386 
Arg Arg Pro Trp Cys Tyr Val Gin Val Gly Leu Lys Pro Leu Val Gin 
110 115 120 

75 GAG TGC ATG GTG CAT GAC TGC GCA GAT GGA AAA AAG CCC TCC TCT CCT 434 

Glu Cys Met Val His Asp Cys Ala Asp Gly Lys Lys Pro Ser Ser Pro 
125 130 135 140 

CCA GAA GAA TTA AAA TTT CAC GGT ACC GTT GCT GCT TGC AAC CTG CCG 482 
Pro Glu Glu Leu Lys Phe Gin Gly Thr Val Ala Ala Cys Asn Leu Pro 

145 ISO 155 

ATT GTC CCT GCT CCG TGC CGT GCT TTC ATC CAG CTG TGG GCT TTC GAC 530 
lie Val Arg Gly Pro Cys Arg Ala Phe lie Gin Leu Trp Ala Phe Asp 

160 165 170 

GCT GTT AAA GGT AAA TGC GTT CTG TTC CCG TAT GGT GGT TGC CAG GGT 578 
Ala Val Lys Gly Lys Cys Val Leu Phe Pro Tyr Gly Gly Cys Gin Gly 

175 180 185 

AAC GGT AAC AAA TTC TAT TCT GAA AAA GAA TGC CGT GAA TAT TGC GGT 626 
Asn Gly Asn Lys Phe Tyr Ser Glu Lys Glu Cys Arg Glu Tyr Cys Gly 

190 195 200 

GTT CCG GGT TGATCCTCTAGAGTCGACCTGCAGGCATGCA 666 
Val Pro Gly 
so 205 207 



25 



30 



35 



40 



45 



55 



EP 0 890 638 A2 



SEQ ID NO :4 
SEQUENCE LENGTH: 2 6 
SEQUENCE TYPE: nucleic acid 
STRANDNESS: single 
TOPOLOGY: linear 
SEQUENCE: 

CGTGAGCGAC TCCAAAGGCA GCAATG 

SEQ ID NO :5 
SEQUENCE LENGTH: 28 
SEQUENCE TYPE: nucleic acid 
STRANDNESS: single 
TOPOLOGY: linear 
SEQUENCE : 

AAACCAGGGC TGGTTCTCGA TGGTGGTG 

SEQ ID NO :6 
SEQUENCE LENGTH: 64 
SEQUENCE TYPE: nucleic acid 
STRANDNESS: single 
TOPOLOGY: linear 
SEQUENCE : 
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CATGAAAAAA ACCGCTATCG CTATCGCTGT TGCTCT&GCT GGTTTTGCTA CCGTTGCTCA 60 
GGCC 64 

SEQ ID NO :7 
SEQUENCE LENGTH: 60 
SEQUENCE TYPE: nucleic acid 
STRANDNESS: single 
TOPOLOGY: linear 
SEQUENCE : 

GGCCTGAGCA ACGGTAGCAA AACCAGCCAG AGCAACAGCG ATAGCGATAG CGGTTTTTTT 60 



SEQ ID NO :8 
SEQUENCE LENGTH: 25 
SEQUENCE TYPE: nucleic acid 
STRANDNESS: single 
as TOPOLOGY: linear 
SEQUENCE : 

GGGTACCATC TGCGCAGTCA TGCAC 25 

40 



SEQ ID NO :9 

45 

SEQUENCE LENGTH: 39 
SEQUENCE TYPE: nucleic acid 
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STRANDNESS: single 
TOPOLOGY: linear 
SEQUENCE : 

GATCCAATCA AATGAGTAAT GAACTACATC AAGTACCAT 

SEQ ID NO :10 
SEQUENCE LENGTH: 37 
SEQUENCE TYPE: nucleic acid 
STRANDNESS : single 
TOPOLOGY: linear 
SEQUENCE : 

CGATGGTACT TGATGTAGTT CATTACTCAT TTGATTG 

SEQ ID NO :11 
SEQUENCE LENGTH: 36 
SEQUENCE TYPE: nucleic acid 
STRANDNESS: single 
TOPOLOGY: linear 
SEQUENCE : 

GGGTACCGTT GCTGCTTGCA ACCTGCCGAT TGTCCG 
SEQ ID NO :12 
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SEQUENCE LENGTH: 32 
SEQUENCE TYPE: nucleic acid 
STRANDNESS : single 
TOPOLOGY: linear 
SEQUENCE : 

GTGATCAACC CGGAACACCG CAATATTCAC GG 

SEQ ID NO :13 
SEQUENCE LENGTH: 51 
SEQUENCE TYPE: nucleic acid 
STRANDNESS : single 
TOPOLOGY: linear 
SEQUENCE : 

GTGAATATTG CGGTGTTCCG GGTGATGGTG ATGAAGAACT GCTGTGATCC T 

SEQ ID NO : 14 
SEQUENCE LENGTH: 57 
SEQUENCE TYPE: nucleic acid 
STRANDNESS: single 
TOPOLOGY: linear 
SEQUENCE : 

CTAGAGGATC ACAGCAGTTC TTCATCACCA TCACCCGGAA CACCGCAATA TTCACGG 
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SEQ ID NO :15 
SEQUENCE LENGTH: 13 
SEQUENCE TYPE: nucleic acid 
STRANDNESS: single 
TOPOLOGY: linear 
SEQUENCE : 
AATTTCAGGG TAC 

SEQ ID NO : 16 
SEQUENCE LENGTH: 5 
SEQUENCE TYPE: nucleic acid 
STRANDNESS: single 
TOPOLOGY: linear 
SEQUENCE : 
CC TAG 

SEQ ID NO : 17 
SEQUENCE LENGTH: 253 
SEQUENCE TYPE: nucleic acid 
STRANDNESS : double 



34 



EP 0 890 638 A2 



TOPOLOGY 


: linear 












SEQUENCE 














GGTTGCTGCT 


TGCAACCTGC 


CGGTTATCCG 


TGGTCCGTGC 


CGTGCTTTCA 


TCCAGCTGTG 


60 


GGCTTTCGAC 


GCTGTTAAAG 


GTAAATGCGT 


TCTGTTCCCG 


TATGGTGGTT 


GCCAGGGTAA 


120 


CGGTAACAAA 


TTCTATTCTG 


AAAAAGAATG 


CCGTGAATAT 


TGCGGTGTTC 


CGGGTGACGA 


180 


AGACGAAGAA 


CTGCTGTGAT 


GATCTAGAGC 


CCAGCCCGCC 


TAATGAGCGG 


GCTTTTTTTT 


240 


GAACAAAAGG 


CGG 










253 



SEQ ID NO : 18 
SEQUENCE LENGTH: 2 61 
SEQUENCE TYPE: nucleic acid 
STRANDNESS: double 
TOPOLOGY: linear 
SEQUENCE: 

AATTCCGCCT TTTGTTCAAA AAAAAGCCCG CTCATTAGGC GGGCTGGGCT CTAGATCATC 60 
ACAGCAGTTC TTCGTCTTCG TCACCCGGAA CACCGCAATA TTCACGGCAT TCTTTTTCAG 120 
AATAGAATTT GTTACCGTTA CCCTGGCAAC CACCATACGG GAACAGAACG CATTTACCTT 180 
TAACAGCCTC GAAAGCCCAC AGCTGGATGA AAGCACGGCA CGGACCACGG ATAACCGGCA 240 
GGTTGCAAGC AGCAACCGTA C 261 

Claims 

1. A chimeric protein comprising a sequence of the following (formula 1) on N-terminal side and a sequence of the 
following (formula 2) on C-terminal side: 
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(formula 1) 

Cys Asp Cys Leu Asn Gly Gly Thr Cys Val Ser Asn Lys Tyr Phe Ser 
Asn lie His Trp Cys Asn Cys Pro Lys Lys Phe Gly Gly Gin His Cys 

(formula 2) 

Cys Asn Leu Pro lie Val Arg Gly Pro Cys Arg Ala Phe lie Gin Leu 
Trp Ala Phe Asp Ala Val Lys Gly Lys Cys Val Leu Phe Pro Tyr Gly 
Gly Cys Gin Gly Asn Gly Asn Lys Phe Tyr Ser Glu Lys Glu Cys Ar* 
Glu Tyr Cys 

The chimeric protein according to claim 1 which further comprises an intervening sequence containing any one of 
the following 4 sequences between said (formula 1) and said (formula 2): 

• (formula 3) -Ala Asp Gly Thr Val Ala Ala 

• (formula 3) -Ala Asp Gly Lys Lys Pro Ser Ser Pro Pro Glu 

Glu Leu Lys Phe Gin Gly Thr Val Ala Ala; 

• Glu lie Asp Lys Ser Lys Thr Thr Val Ala Ala; and 



Glu lie Asp Lys Ser Lys Thr Val Ala Ala. 



3. The chimeric protein according to claim 1 comprising a sequence represented by (formula A): 
N terminal-(sequence l)-(formula 1)-(sequence ll)-(formula 2)-(sequence lll)-C terminal (formula A) in (for- 
mula A), (formula 1) and (formula 2) are as defined above. 

(Sequence I) represents a hydrogen atom or any one of the following amino acid sequences: 
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Ser Asn Glu Leu His Gin Vhl Pro Ser Asn 
Asn Glu Leu His Gin Val Pro Ser Asn 
Glu Leu His Gin Val Pro Ser Asn 
Leu His Gin Val Pro Ser Asn 
His Gin Val Pro Ser Asn 
Gin Val Pro Ser Asn 
Val Pro Ser Asn 
Pro Ser Asn 
Ser Asn 
Asn 



(sequence II) represents any one of sequences selected from a group containing (formula 3) and a group not 
containing (formula 3) 



a group containing (formula 3) 
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( formula 


3) 


-Ala 


Asp 


Gly 


Thr 


Val 


(formula 


3] 


-Ala 


Asp 


Gly 


Val 


Ala 


( formula 


3) 


-Ala 


Asp 


Gly 


Ala 


Ala 


( formula 


3) 


-Ala 


Asp 


Gly 


Xaa 




( formula 


3) 


-Ala 


Asp 


Thr 


Val 


Ala 


(formula 


3) 


-Ala 


Asp 


Val 


Ala 


Ala 


( formula 


3) 


-Ala 


Asp 


Ala 


Ala 




(formula 


3) 


-Ala 


Asp 


Xaa 






( formula 


3) 


-Ala 


Thr 


Val 


Ala 


Ala 


(formula 


3) 


-Ala 


Val 


Ala 


Ala 




( formula 


3) 


-Xaa 


Thr 


Val 


Ala 


Ala 


(formula 


3) 


-Xaa 


Val 


Ala 


Ala 




( formula 


3) 


-Xaa 


Ala 


Ala 






( formula 


3) 


-Xaa 


Xaa 








( formula 


3) 


-Val 


Ala 


Ala 






( formula 


3) 


-Xaa 










( formula 


3) 


-Ala 


Asp 


Gly 


Lys 


Lys 



Glu Leu Lys Phe Gin Gly Thr Val Ala Ala 



a group not containing (formula 3) 
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w 



15 



SO 



25 



30 



35 



40 



45 



Glu lie Asp Lys Ser Lys TRr Thr Val Ala Ala 
Glu He Asp Lys Ser Lys Thr Val Ala Ala 
Glu He Asp Lys Ser Lys Thr Ala Ala 

Glu He Asp Lys Ser Lys Thr Xaa 

Glu He Asp Lys Ser Lys Xaa 

Glu He Asp Lys Ser Lys Val Ala Ala 

Glu He Asp Lys Ser Lys Ala Ala 

Glu He Asp Lys Ser Thr Val Ala Ala 

Glu He Asp Lys Ser Val Ala Ala 

Glu He Asp Lys Ser Ala Ala 

Glu He Asp Lys Ser Xaa 

Glu He Asp Lys Thr Val Ala Ala 

Glu lie Asp Lys Val Ala Ala 

Glu He Asp Lys Ala Ala 

Glu He Asp Lys Xaa 

Glu He Asp Thr Val Ala Ala 

Glu lie Asp Val Ala Ala 

Glu He Asp Ala Ala 

Glu He Asp Xaa 

Glu He Thr Val Ala Ala 



50 
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Glu He Val Ala Ala_ 

Glu He Ala Ala 

Glu He Xaa 

Glu Thr Val Ala Ala 

Glu Val Ala Ala 

Glu Ala Ala 

Glu Xaa 

Xaa 

provided that Xaa represents any amino acid constituting a protein, formula 3 represents the following 
sequence corresponding to 43-131 of uPA: 

(formula 3) 

Glu He Asp Lys Ser Lys Thr Cys Tyr Glu Gly Asn Gly His Phe Tyr 

Arg Gly Lys Ala Ser Thr Asp Thr Wet Gly Arg Pro Cys Leu Pro Trp 

Asn Ser Ala Thr Val Leu Gin Gin Thr Tyr His Ala His Arg Ser Asp 

Ala Leu Gin Leu Gly Leu Gly Lys His Asn Tyr Cys Arg Asn Pro Asp 

Asn Arg Arg Arg Pro Trp Cys Tyr Val Gin Val Gly Leu Lys Pro Leu 

Val Gin Glu Cys Met Val His Asp Cys 

(Sequence ill) represents a hydroxyl group (-OH) or any of the following amino acid sequences: 



40 
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10 



15 



20 



25 



G] v Val 


O f /"»v 

IT JL U 


Gly Asp Gly Asp 


Glu 


Glu 

VJ X wL 


vjx y v ax 


O r* /~n 


Gly Asp 


Gly Asp 


Glu 


Glu 


\j xy vai 


rru 


Gly Asp 


Gly Asp 


Glu 

\J x u 


Glu 

wX U 


oxy val 


rro 


Gly Asp 


Gly Asp 


wX U. 




oiy Vax 


Pro 


Gly Asp 


Gly Asp 






(jiy vai 


Pro 


Gly Asp 


Gly 






Gly val 


Pro 


Gly Asp 








Gly Val 


Pro 


Gly 








Gly Val 


Pro 










Gly Val 












Gly 













30 

4. The chimeric protein according to claim 3 wherein sequence II is 

(formula 3) -Ala Asp Gly Thr Val Ala Ala 
or 

(formula 3) -Ala Asp Gly Lys Lys Pro Ser Ser Pro Pro Glu 
Glu Leu Lys Phe Gin Gly Thr Val Ala Ala 



35 
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45 



when selected from a group containing (formula 3), and sequence II is 

Glu lie Asp Lys Ser Lys Thr Thr Val Ala Ala 



or 



50 



55 



Glu lie Asp Lys Ser Lys Thr Val Ala Ala 
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when selected from a group not containing (formula 3). 

The chimeric protein according to claim 3 wherein sequence I is represented by Ser Asn Glu Leu His Gin Val Pro 
Ser Asn. 

The chimeric protein according to claim 3 wherein sequence I is represented by Ser Asn Glu Leu His Gin Val Pro 
Ser Asn, and sequence II is 

(formula 3) -Ala Asp Gly Thr Val Ala Ala 
or 

(formula 3) -Ala Asp Gly Lys Lys Pro Ser Ser Pro Pro Glu 
Glu Leu Lys Phe Gin Gly Thr Val Ala Ala 

when selected from a group containing (formula 3), and sequence II is 

. Glu lie Asp Lys Ser Lys Thr Thr Val Ala Ala 
or 

Glu lie Asp Lys Ser Lys Thr Val Ala Ala 
when selected from a group not containing (formula 3). 

A DNA coding for a chimeric protein comprising a sequence of the following (formula 1 ) on 5' side and a sequence 
of the following (formula 2) on 3 1 side: 

(formula 1) 
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Cys Asp Cys Leu Asn Gly Gly Tfar Cys Val Ser Asn Lys Tyr Phe Ser 
£ Asn He His Trp Cys Asn Cys Pro Lys Lys Phe Gly Gly Gin His Cys 

(formula 2) 

70 



Cys 


Asn 


Leu 


Pro 


lie 


Val 


Arg 


Gly 


Pro 


Cys 


Arg 


Ala 


Phe 


He 


Gin 


Leu 


Trp 


Ala 


Phe 


Asp 


Ala 


Val 


Lys 


Gly 


Lys 


Cys 


Val 


Leu 


Phe 


Pro 


Tyr 


Gly 


Gly 


Cys 


Gin 


Gly 


Asn 


Gly 


Asn 


Lys 


Pbe 


Tyr 


Ser 


Glu 


Lys 


Glu 


Cys 


Arg 



Glu Tyr Cys 



8. The DNA according to claim 7 coding for a chimeric protein comprising a sequence represented by (formula A): 
25 N terminal-(sequence l)-(formula 1)-(sequence ll)-(formula 2)-(sequence lll)-C terminal (formula A) in (for- 
mula A), (sequence I), (formula 1), (sequence II). (formula 2) and (sequence III) are as defined above. 

9. A plasmid comprising DNA according to claim 7 or 8. 

30 1 0. A tranformant into which the plasmid according to claim 9 is introduced. 

11. A cancerous metastasis inhibitor comprising the chimeric protein according to any of claims 1-6 as active ingredi- 
ent. 

35 12. A method for producing a chimeric protein comprising introducing into a host cell a plasmid into which the DNA 
according to claim 7 or 8 is integrated to produce a transformant culturing the transformant and recovering the chi- 
meric protein from a culture. 

13. A method for prophylaxis of cancerous metastasis comprising administering a therapeutic amount of the chimeric 
40 protein according to any of claims 1-6 to a patient of cancer. 

14. The transformant according to claim 10 wherein said transformant is PERM BP-5293. 

15. The transformant according to claim 10 wherein said transformant is FERM BP-5745. 

45 

16. The transformant according to claim 10 wherein said transformant is FERM BP-5746. 

1 7. The protein according to claim 1 comprising an amino acid sequence which corresponds to 1 -1 93 of SEQ ID NO 1 . 
so 18. The protein according to claim 1 comprising an amino acid sequence which corresponds to 1 -200 of SEQ ID NO 2, 

1 9. The protein according to claim 1 comprising an amino acid sequence which corresponds to 1 -207 of SEQ ID NO 3. 

20. The DNA according to claim 7 comprising a nucleic acid sequence which corresponds to 15-593 of SEQ ID NO 1 . 

55 

21 . The DNA according to claim 7 comprising a nucleic acid sequence which corresponds to 15-614 of SEQ ID NO 2. 

22. The DNA according to claim 7 comprising a nucleic acid sequence which corresponds to 15-635 of SEQ ID NO 3. 
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Fig. 1 



ATF, Amino Acid (1-135) 70 
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Primary structure of urokinase (uPA) 
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HeuAc(GluNAc)<(Gel)j(Man),GluNAc 
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Fig. 3 



70 

c lp w «s At 



1-134: 

Urokinase 
Amino-terminal 
8 0 fragment 



V L 

c \ - o 

H t C Vf P x i 0 T 



R * ^ "0 



T v T R 

» 



T 



H 



A 



S C H 

A L N * 

S 

K K 120 o o go 

S 60 c p A 

N R N . L 

v U R ^ 

E v Q 

F 0 C 

F E *S* L 

U K S " G C * 100 c 

H N N M N C L 

N r C V H K " 

O s C ° EH 

V, ori V Y D130 



s n c c c K S K T * 135-193 : 

10 „ i x, y HI-8 (1-59) 



30 



50 c - 



' T 



V 



P H * ^ v ^190 



K 



Q 40 



135 a 



E C 



ATFHI 



F G 140 H K 

L R v 160 

P K E c — c 

✓ ' E V A 

1 S 180 



Sequence Length : 193 

Hydrophobicity 8242.49% c ° 170 

Neutral 5729.53% p c Y L 

Hydrophilicity 5427.98% 0 0 

Molecular Weight =21564.13 r a f 

1 50 



Fig. 4 

Bame I Ta ^ 1 
I 1 I * 

5 ' - ( AG ) GATCCAATCAAATGAGTAATGAACTAC ATC AAGTACCAT ( CGA ) - 3 ' 

| | GTTAGTTTACTCATTACTTGATGTAGTTCATGGTAGC-5 ' 

S/O MSNEIiHQVP 
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Fig. 5 



Pr-1 



5 ' -CGTGAGCGACTCCAAAGGC 
ATGAGAGCCCTGCTGGCGCGCCTGCTTCTCTGCGTCCTGGTCGTGAGCGACTCCAAAGGC 
MRALLARLLLCVLVVSDSKG 
-20 -15 -10 -5 



60 



AGCAATG— 3 ' 

AGCAATGAACTTCATCAAGTTCCATCGAACTGTGACTG^CTAAATGGAGGAACATGTGTG 120 
SNELHQVPSNCDCLNGGTCV 
15 10 15 20 

TCCAACAAGT ACTTCTCCAACATTCACTGGTGGAAC TGCCCAAAGAAATTCGGAGGGCAG 180 
SNKYFSN IHWCNCPKKFGGQ 
25 30 35 40 

CACTGTGAAATAGATAAGTCAAAAACCTGCTATGAGGGGAATGGTCACTTTTACCGAGGA 240 
BCEIDKSKTCYEGNGHFYRG 
45 50 55 60 

AAGGCCAGCACTGACACCATGGGCCGGCCCTGCCTGCCCTGGAACTCTGCCACTGTCCTT 300 
KAS TDTMGRPCLPWN SATVL 
65 70 75 80 

CAGCAAACGTACCATGCCCACAGATCTGATGCTCTTCAGCTGGGCCTGGGGAAACATAAT 360 
QQTYHABRSDALQLGLGKHN 
85 90 95 100 



TACTGCAGGAACCCAGACAACCGGAGGCGACCCTGGTGCTATGTGCAGGTGGGCCTAAAG 420 
YCRNPDNRRRPWCYVQVGXiK 
105 110 115 120 



Pr-3 



3 ' — CACGTACTGACGCGTCTACCA TGGG- 5 * 
CCGCTTGTCCAAGAGTGCATGGTGCATGACTGCGCAGATGGAAAAAAGCCCTCCTCTCCT 480 
PLVQECMVHDCADGKKPSSP 
125 130 135 140 



CCAGAAGAATTAAAATTTCAGTGTGGCCAAAAGACTCTGA 540 
PEE LKFQCGQKTLRPRFKI I 
145 150 155 160 



Pr-2 



3 1 -GTGGTGGTAGCTCTTGGTCGGGACCAAA- 5 ' 
GGGGGAGAATTCACCACCATCGAGAACCAGCCCTGCT^ 600 
GGEFTTIENQPWFAAI Y R R H 
165 165 170 175 
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F i g- 6 



Pr-4 



5 ' — GGGT AC CGTTGCTGCTTGC AACCTGCCG ATT OTCCG- 3 ' 

ACGGTTGCTGCTTGCAACCTGC CGGTT ATCCGTGGTCCGTGC C GTGCTTT CATC CAGCTG 6 0 

TVAACMLPVIRGPCR'AFIQZ. 

1 5 10 15 20 

TGGGCTTTCGACGCTGTTAAAGGTAAATGCGTTCTGTTCCCGTATGGTGGTTGCCAGGGT 120 
WAFDAVKGKCVLFPYGGCQG 
25 30 35 40 



Pr-5 



3 ' -GGCACTTATAACGCCACAAGGCCCAACTAGTG-5 * 
AACGGTAACAAATTCTATTCTGAAAAAGAATGCCGTGAATATTGCGGTGTTCCC^ 180 
NGNKFYSEKECREYCGVPGD 
45 50 55 60 

GAAGACGAAGAACTGCTGTGATGATCTAGAGCCCAGCCCGCCTAA^GAGCGGGCTTTTTT 240 
E D E E L I* 
65 



48 
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F i g. 7 



PCR 



Primer 
Pr-1 ,Pr-E 



t 

( pCR II ) 




Taql, Kpnl ■ 



h PCR 

Primer 
Pr-1 ,Pr-3 



tac 

promoter 



/ D u , BamHl Taql 
Hindlll / BamHl i . 

□Baa. 




BaaBX T * < J I 

5 • ^XTCCWlTCJUU^TGACTJUVTCAACTACATCXACTXCCXT-3 * 5 ' -CGAACTCTCAC , . . 

3 * -CTTACrrTACTCATTrJlCTTGATGTACTTCATCCTXCC-5 ' 3 * -TTCACACTC . . . 

MSHZL HQVPS BCD. 

12 3 4 5 6 7 6 9 10 11 12 




tpnX 

, .CT<^TC^CTCCCCACAT«CTAC-3 * 
. . .CACC1ACTCACGCCTCTAC-5 ' 
. . .V B D C A D C 
130 134 



tac promoter 

M DNAadaptor (2) BamHl -TaqtDNA adaptor 



(3) Taql -KpniDNA fragment 



(S)ligation 



j 



(4) 



Hind III BamH ^' Taq 




Kpnl 

Hindlll 

BAP 



Kpnl 



4 
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PCR Primer 

Pr-4,Pr-5 



176bp 



Kpnl Bell 




Kpnl 

5 •-CGTTGCTGCTTGCAACCTGCCGATTCTCCG. 
3 ' -CATGGCAACGACGAACCTTGGACGGCTAACAGGC . 
TVAACRLPIV . . 
123456789 10 



GCTGTTCCGGGTT-3 ' 
CCACAAGGCCCAACTAG-5 ' 
G V P G STOP 
56 57 58 59 



(1) Kpnl -Bell DNAf ragment 

(2) 



(3) ligation 



Kpnl 



268lbp 



Kpnl 

BamHI 

BAP 



Bcll/BamHI 
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Fig. 9 




Xmnl 



(i) Kpni-Xmai Defragment 

2356bp 



Kpnl 

| hi-8: i-59j x Bcll/BamHI 




Xmnl 



(2) Kpnl* Xmnl DNAfr 
997bp 



(3)ligation 



Kpnl 



. . .CCAGATCCT^Cj-3 * S ' f cCTTCCTCCT . 
. . .CGTCTAd -S ' 3' kXTCCCAXOCACCX. 
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Fig. 10. 
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Fig. 11 



EcoR I 

Kpn I 

r-lacZ a 




f— EcoR I 
— Kpn I 



— ligation 

EcoR I 
Rsa I 




lacZcr 



blunt 



Rsa I 
EcoR I 
EcoR I 

Y//////////m Nco I EcoR I 
i i 



— ligation 
(Nco I /BspH I ) 
OmpA signal 

EcoR I 
lacZa 




cleavage site 
Kpn I complementary site 

EcoR I 



Synthetic gene 



Nco I 

EcoR I 
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blunt 



Synthetic OmpA signal 
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Fig. 12 

1-134: 

70 80 Urokinase 

pC L pWNS A TVL Amino-terminal 

c R \ Q q fragment 

t « v Y c « p R no 

* B 



» o V 



.* v R 

c 8 

s 



1 K K 120 D D 90 

H R " L 



L T * s £1 * C 



r L v R 0 



K 



S (j C T 100 C 

N N K H - X* 



« N n H c 

H j c V B K 



s — * °n b 



P 



T D130 

P l h 30 so c-^ c 

S H c C C K * A / 3A ,~ 



10 N I D ^/ „ C L 

c— c E <V 



Z " c L 200 



P H P 



G 



-57 v v 

* 0 40 /' A c 190 135-200: 

'■ c 

_C v 160 



135 A ^ 0 HI -8 (1-66) 



s r 180 



ATFHI-CL 

Sequence Length : 2 00 
Hydrophobicity 8642.50% c c q 170 T * „ * 

Neutral 5728.50% p >: c c & 

Hydrophilicity 5829.00% x o 

Molecular Weight =22335.90 * r 
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(1) 3323bp DNA 



fragment 



Fig. 13 

Bsm I 



Xba t 



. AA AGAATGCC GTGAATATTGCGCTGTTCCGGGTTGATC CTCTAGA . 
. TTTCTTXCGGCACTTATAACGCCACAAGGCCCAACTAGGAGATCT . 
KECjREYCGVPG STOP 

Bsrn 1 Xba 




(2) Bsmi-Xbai adaptor 



5 *ppj GTGAATATTGCGGTGTTCCGGGTGATGGTGATCIAAGAACTGCTGTGATCC T] -3 
GGCACTTATAACGCCACAAGGCCCACTACCACTACTTCTTGACGACACTAGGAGATC | -p5 



3 



Bsm I 



EYCGVPGDGDEEL 
55 60 65 



CTGTGATCCnj -3 

: GACACT AGGAGATC | -p5 
L STOP 

Xba I 



(3) ligation 




pAIP-CL 
3374bp 
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Fig. 14 



1-147: 
so Urokinase 
70 cL * w NSA Tvt Amino-terminal 
a * ? \ V 0 o fragment 

m T c h p _ no * 

D Q R H 



R 
s 

* K 120 » D 90 

s 60 c * A 



K. 

1 



H R P « L 

Y L V R 0 



r 0 - 



. T ' s 8 fi ' 100 . c 



B 



* H W « M C L 

c v a k 



• • ,° % . 

v v T D 1 30 

D x s * * * e S r / 
10 j» x 0 p o*^ 149-207: 

«-« ■ P140 , C HI-Sd-59) 

P H * - **** ^ V 



149 \s c 1 , 

C EC 
" K 
L 200 R K v 



. c 

* . K E V A 



C 

I 



ATFHI-ML v 190 s % 180 1 • 



k r f 



R W 

Sequence Length : 207 c p a 170 

Hydrophobicity 8842.51% 160 c Q G * t w 

Neutral 6028.99% P c ^ ° x Q 

Hydrophilicity 5928.50% * A p 

Molecular Weight =23117.86 
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Fig- 15 



Ncol Apoi 




Nco[. Apoi cleavage 



(i) 23€bpDNAfragment < 3 > Apoi • Kpni adaptor 

Ncol Apoi 



> 1 p-| AATT? 

7 



5 ' p-| A&TTTCAGGGTAT \ 
1 AGTCl -pS' 



Apoi Kpni 




Ncol Kpni 

\ / 



ATF: 1-134 [ | HI-8: 1-59 



pAIP 
3353bp 




Ncol, Kpaicleavage 
(2) 3M6bp dna fragment 



Ncol XpnX 





{*) ligation 



Ncol Apoi Kpni 
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Fig. 17 




igand concentration (nM) 
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